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Diagnosis and Treatment of Vascular Disease 

Background of the Invention 

Cardiovascular disease is a major health risk throughout the industrialized world. 
Coronary artery disease (CAD), or atherosclerosis, involves the progressional narrowing of 
the arteries due to a build-up of atherosclerotic plaque. Myocardial infarction (MI), e.g., 
heart attack, results when the heart is damaged due to reduced blood flow to the heart caused 
by the build-up of plaque in the coronary arteries. 

Coronary artery disease, the most prevalent of cardiovascular diseases, is the 
principal cause of heart attack, stroke, and gangrene of the extremities, and thereby the 
principle cause of death in the United States. Coronary artery disease, or atherosclerosis, is a 
complex disease involving many cell types and molecular factors (described in, for example, 
Ross, 1993, Nature 362: 801-809). The process, in normal circumstances a protective 
response to insults to the endothelium and smooth muscle cells (SMCs) of the wall of the 
artery, consists of the formation of fibrofatty and fibrous lesions or plaques, preceded and 
accompanied by inflanmiation. The advanced lesions of atherosclerosis may occlude the 
artery concerned, and result from an excessive inflammatory-fibroproliferative response to 
nximerous different forms of insult. Injury or dysfunction of the vascular endothelium is a 
common feature of may conditions that predispose a subject to accelerated development of 
atherosclerotic cardiovascular disease. For example, shear stresses are thought to be 
responsible for the frequent occurrence of atherosclerotic plaques in regions of the 
circulatory system where turbulent blood flow occurs, such as branch points and irregular 
structures. 

The first observable event in the formation of an atherosclerotic plaque occurs when 
blood-bome monocytes adhere to the vEiscular endothelial layer and transmigrate through to 
the sub-endothelial space. Adjacent endothelial cells at the same time produce oxidized low 
density lipoprotein (LDL). These oxidized LDLs are then taken up in large amounts by the 
monocytes through scavenger receptors expressed on their surfaces. In contrast to the 
regulated pathway by which native LDL (nLDL) is taken up by nLDL specific receptors, the 
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scavenger pathway of uptake is not regulated by the monocytes. 

These lipid-filled monocytes are called foam cells, and are the major constituent of 
the fatty streak. Interactions between foam cells and the endothelial and SMCs which 
surround them lead to a state of chronic local inflammation which can eventually lead to 
5 smooth muscle cell proliferation and migration, and the formation of a fibrous plaque. 

Such plaques occlude the blood vessel concerned and, thus, restrict the flow of blood, 
resulting in ischemia. Ischemia is a condition characterized by a lack of oxygen supply in 
tissues of organs due to inadequate perfusion. Such inadequate perfusion can have a number 
of natural causes, including atherosclerotic or restenotic lesions, anemia, or stroke. Many 
10 medical interventions, such as the interruption of the flow of blood during bypass sxjrgery, 
for example, also lead to ischemia. In addition to sometimes being caused by diseased 
cardiovascular tissue, ischemia may sometimes affect cardiovascular tissue, such as in 
ischemic heart disease. Ischemia may occur in any organ, however, that is suffering a lack of 
oxygen supply. 

15 One of the most important risk factors for coronary artery disease is a familial 

history. Although family history subsumes both genetic and shared environmental factors, 
studies suggest that CAD has a very strong genetic component (Marenberg, et al. (1994) 
AE/M 330: 1041). Despite the importance of family history as a risk factor for CAD, it's 
incomplete genetic basis has not been elucidated. Therefore, the identification of genes 

20 which are involved in the development of CAD and MI would be beneficial. 

It would thus be beneficial to identify polymorphic regions within genes which are 
associated with a vascular disease or disorder, such as coronary artery disease or myocardial 
infarction. It would further be desirable to provide prognostic, diagnostic, 
pharmacogenomic, and therapeutic methods utilizing the identified polymorphic regions. 

25 

Summary of the Invention 

The present invention is based, at least hi part, on the identification of polymorphic 
regions within the thrombospondin 2 (THBS2) gene, angiotensin converting enzyme 1 
(ACE) gene, and the beta fibrinogen (FGB) gene which are associated with specific diseases 
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or disorders, including vascular diseases or disorders. In particular, single nucleotide 
polymorphisms (SNPs) in these genes which are associated with premature coronary artery 
disease (CAD) (or coronary heart disease) and myocardial infarction (MI) have been 
identified. SNPs in these genes, as identified herein, singly or in combination, can be 
5 utilized to predict, in a subject, a decreased risk for developing a vascular disease, e.g., CAD 
and/or ML 

The SNPs identified herein may further be used in the development of new treatments 
for vascular disease based upon comparison of the variant and normal versions of the gene or 
gene product {e.g., the reference sequence), and development of cell-culture based and 
M= 10 animal models for research and treatment of vascular disease. The invention further relates 

, 3 to novel compoimds and pharmaceutical compositions for use in the diagnosis and treatment 

I. .i 

;■ J of such disorders. In preferred embodiments, the vascular disease is CAD or MI. 

*' ^' The polymorphisms of the invention may thus be used, both singly or in combination, 

,,5 in prognostic, diagnostic, and therapeutic methods. For example, the polymorphisms of the 

j^^ .^ 15 invention can be used to determine whether a subject is or is not at risk of developing a 
I ^ disease or disorder associated with a specific allelic variant of a THBS2, ACE, or FGB 

:p polymorphic region, e.g., a disease or disorder associated with aberrant THBS2, ACE, or 

i!I FGB activity, e.g., a vascular disease or disorder such as CAD or MI. 

The invention thus relates to isolated nucleic acid molecules and methods of using 
20 these molecules. The nucleic acid molecules of the invention include specific THBS2, ACE, 
or FGB allelic variants which differ from the reference THBS2, ACE, or FGB sequences set 
forth in SEQ ID NO:l (GI 307505), SEQ ID N0:3 (GI 13027555), or SEQ ID NO:5 (GI 
182597), respectively, or a portion thereof. The preferred nucleic acid molecules of the 
invention comprise THBS2, ACE, or FGB polymorphic regions or portions thereof having 
25 the polymorphisms shown in Tables 1, 4, and 6 (corresponding to SEQ ID NOs.:7, 8, 9, 10, 
and 11), polymorphisms in linkage disequilibrium with the polymorphisms shown in Tables 
1, 4, and 6, and combinations thereof. Nucleic acids of the invention can function as probes 
or primers, e.g„ in methods for determining the allehc identity of a THBS2, ACE, or FGB 
polymorphic region in a nucleic acid of interest. 
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The nucleic acids of the invention can also be used, singly or in combination, to 
determine whether a subject is or is not at risk of developing a disease associated with a 
specific allelic variant of a THBS2, ACE, or FGB polymorphic region, e.g., a disease or 
disorder associated with aberrant THBS2, ACE, or FGB activity, e.g., a vascular disease or 
5 disorder such as CAD or MI. The nucleic acids of the invention can further be used to 
prepare THBS2, ACE, or FGB polypeptides encoded by specific alleles, such as mutant 
(variant) alleles. Such polypeptides can be used in therapy. Polypeptides encoded by 
specific THBS2, ACE, or FGB alleles, such as variant THBS2, ACE, or FGB polypeptides, 
can also be used as immimogens and selection agents for preparing, isolating or identifying 

10 antibodies that specifically bind THBS2, ACE, or FGB proteins encoded by these alleles. 
Accordingly, such antibodies can be used to detect variant THBS2, ACE, or FGB proteins. 

There are two preferred SNPs in the THBS2 gene. One polymorphism fovind in the 
THBS2 gene in the population screened is a change from a thymidine (T) to a guanine (G), 
or the complement thereof, in the THBS2 gene at residue 3949 of the reference sequence GI 

15 307505 (polymorphism ID No. G5755e5). A second polymorphism in the THBS 2 gene is a 
change firom a thymidine (T) to a cytidine (C), or the complement thereof, at residue 4476 of 
the reference sequence GI 307505 (polymorphism ID No. G5755e9). These polymorphisms 
are located in the 3' mtranslated region (UTR) of the THBS2 gene, and therefore do not 
result in a change in the amino acid sequence of the THBS2 protein, 

20 There is one preferred SNP in the ACE gene. This SNP, identified herein as G765u2, 

is a change firom an adenine (A) to a guanine (G), or the complement thereof, at nucleotide 
residue 86408 of the ACE reference sequence GI 13027555. This SNP is a "silent" variant. 
That is, it does not result in a change in the amino acid sequence of the ACE protein. 

There are two preferred SNPs in the FGB gene. One SNP, referred to herein as 

25 FGBul , is a change fi-om a cytidine (C) to a thymidine (T), or the complement thereof, at 
nucleotide residue 5119 of the FGB reference sequence GI 182597. This SNP is a silent 
variant. The second SNP, FGBu4, is a change firom a guanine (G) to an adenine (A), or the 
complement thereof, at nucleotide residue 8059 in the reference sequence GI 182597. This 
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polymorphism is a missense variation which results in a change from an arginine (R) to a 
lysine (K) in the anaino acid sequence of FGB (SEQ ID N0:6) at amino acid residue 478. 

The nucleic acid molecules of the invention can be double- or single-stranded. 
Accordingly, in one embodiment of the invention, a complement of the nucleotide sequence 
5 is provided wherein the polymorphism has been identified. For example, where there has 
been a single nucleotide change from a thymidine to a cytidine in a single strand, the 
complement of that strand will contain a change from an adenine to a guanine at the 
corresponding nucleotide residue. The invention ftirther provides allele-specific 
oligonucleotides that hybridize to a gene comprising a polymorphism of the present 
10 invention or to its complement. 

The polymorphisms of the present invention, either singly, in combination with each 
other, or in combination with previously identified polymorphisms, are shown herein to be 
associated with specific disorders, e.g., vascular diseases or disorders. Examples of vascular 
diseases or disorders include, without limitation, atherosclerosis, coronary artery disease 
15 (CAD), myocardial infarction (MI), ischemia, stroke, peripheral vascular diseases, venous 
thromboembolism and pulmonary embolism. 

The invention further provides vectors comprising the nucleic acid molecules of the 
present invention; host cells transfected with said vectors whether prokaryotic or eukaryotic; 
and transgenic non-human animals which contain a heterologous form of a ftmctional or non- 
20 functional THBS2, ACE, or FGB allele described herein. Such a transgenic animal can serve 
as an animal model for studying the effect of specific THBS2, ACE, or FGB allelic 
variations, including mutations, as well as for use in drug screening and/or recombinant 
protein production. 

In another preferred embodiment, the method comprises determining the nucleotide 
25 content of at least a portion of a THBS2, ACE, or FGB gene, such as by sequence analysis. 
In yet another embodiment, determining the molecular structure of at least a portion of a 
THBS2, ACE, or FGB gene is carried out by single-stranded conformation polymorphism 
(SSCP). In yet another embodiment, the method is an oligonucleotide ligation assay (OLA). 
Other methods within the scope of the invention for determining the molecular structure of 
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at least a portion of a THBS2, ACE, or FGB gene include hybridization of 2tllele-specific 
oligonucleotides, sequence specific amplification, primer specific extension, and denaturing 
high performance liquid chromatography (DHPLC). In at least some of the methods of the 
invention, the probe or primer is allele specific. Preferred probes or primers are single 
5 stranded nucleic acids, which optionally are labeled. 

The methods of the invention can be used for determining the identity of a nucleotide 
or amino acid residue within a polymorphic region of a hummi THBS2, ACE, or FGB gene 
present in a subject. For example, the methods of the invention can be usefixl for determining 
whether a subject is or is not at risk of developing a disease or condition associated with a 
10 specific allelic variant of a polymorphic region in the human THBS2, ACE, or FGB gene, 

•"issf 

!;:! e.g,, a vascular disease or disorder. 

• .J In one embodiment, the disease or condition is characterized by an aberrant THBS2, 

^ ACE, or FGB activity, such as aberrant THBS2, ACE, or FGB protein level, which can result 

=F from aberrant expression of a THBS2, ACE, or FGB gene. The disease or condition can be 

.-sr 15 CAD, MI, or another vascular disease. Accordingly, the invention provides methods for 
.1 predicting a subject's risk for developing a vascular disease associated with aberrant THBS2, 

;t ACE, or FGB activity. In a preferred embodiment, a subject having "pattern 1," which 

5 

^ comprises two copies of the variant allele of G5755e9 (CC) in combination with two copies 

of the reference allele of G5755e5 (TT), or the complement thereof, or "pattern 2", which 
20 comprises two copies of the reference allele of G5755e9 (TT) and two copies of the variant 
allele of G5755e5 (GG), or the complement thereof, is at a approximately 3-fold decreased 
odds of vascular disease compared to all other combinations of genotypes at these two loci. 

In another preferred embodiment, a subject having one copy of an A and one copy of 
a G at nucleotide 86408 of the ACE reference sequence GI 13027555 (AG genotype), or the 
25 complement thereof, is at a decreased risk for vascular disease relative to persons with other 
genotypes for this SNP (e.g., AA or GG genotypes). 

In yet another preferred embodiment, a subject having two copies of a T at nucleotide 
residue 5119 of the FGB reference sequence GI 182597, or the complement thereof, is at a 
3-fold decreased risk for vascular disease relative to persons with the CC genotype. A 
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subject having one copy of a T and one copy of a C at nucleotide residue 51 19 of the FGB 
reference sequence GI 182597, or the complement thereof, is also at a decreased risk for 
vascular disease relative to persons with the CC genotype. 

In still another preferred embodiment, a subject having two copies of an A at 
5 nucleotide residue 8059 of the FGB reference sequence GI 182597, or the complement 
thereof, is at a ~3-fold decreased risk for vascular disease relative to persons with the GG 
genotype. A subject having one copy of an A and one copy of a G at nucleotide residue 5119 
of the FGB reference sequence GI 182597, or the complement thereof, is also at a decreased 
risk for vascular disease relative to persons with the GG genotype (see Example 1). 
i f 10 Additionally, the invention provides a method of identifying a subject who is or is not 

Q susceptible to a vascular disorder, which method comprises the steps of i) providing a nucleic 

i, JL 

•' a acid sample from a subject; and ii) detecting in the nucleic acid sample the presence or 

absence of a THBS2, ACE, or FGB gene polymorphism, or both in combination, that 
•'P correlate with the vascular disorder with a P value less than or equal to 0.05. 

1 .4 1 5 The invention further provides forensic methods based on detection of 

; ^ polymorphisms within the THBS2, ACE, or FGB gene. 

F The invention also provides probes and primers comprising oligonucleotides, which 

l-i correspond to a region of nucleotide sequence which hybridizes to at least 6 consecutive 

nucleotides of the sequence set forth as SEQ ID NOs.:7, 8, 9, 10, and 1 1 or to the 
20 complement of the sequences set forth as SEQ ID N0s.:7, 8, 9, 10, and 1 1, or naturally 

occurring mutants or variants thereof. In preferred embodiments, the probe/primer further 
includes a label attached thereto, which is capable of being detected. 

A kit of the invention can be used, e.g., for determining whether a subject is or is not 
at risk of developing a disease associated with a specific allelic variant of a polymorphic 
25 region of a THBS2, ACE, or FGB gene, e.g., a vascular disease, e,g., CAD or MI. In a 

preferred embodiment, the invention provides a kit for determining whether a subject is or is 
not at risk of developing a vascular disease such as, for example, atherosclerosis, CAD, MI, 
ischemia, stroke, peripheral vascular diseases, venous thromboembolism and pulmonary 
embolism. The kit of the invention can also be used in selecting the appropriate clinical 
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course of clinical treatment to a subject to treat a disease or condition, such as a disease or 
condition set forth above. Thus, determining the allelic variants of THBS2, ACE, or FGB 
polymorphic regions of a subject can be useful in predicting how a subject will respond to a 
specific drug, e.g., a drug for treating a disease or disorder associated with aberrant THBS2, 
5 ACE, or FGB, e.g., a vascular disease or disorder. 

Other features and advantages of the invention will be apparent from the following 
detailed description and claims. 

Brief Description of the Figures 

10 Figure 1 depicts the nucleotide sequence corresponding to reference sequence GI 

307505 (SEQ ID NO:l) for the THBS2 gene. 

Figure 2 depicts the amino acid sequence corresponding to reference GI 4507487 
(SEQ ID N0:2) for the THBS2 protein. 

Figure 3 depicts the nucleotide sequence corresponding to reference sequence GI 
15 13027555 (SEQ ID N0:3) for the ACE gene. 

Figure 4 depicts the amino acid sequence corresponding to reference GI 4503273 
(SEQ ID N0:4) for the ACE protein. 

Figure 5 depicts the nucleotide sequence corresponding to reference sequence GI 
182597 (SEQ ID NO:5) for the FGB gene. 
20 Figure 6 depicts the amino acid sequence corresponding to reference GI 11 761 63 1 

(SEQ ID N0:6) for the FGB protein. 

Detailed Description of the Invention 

The present invention is based, in part, on the identification of polymorphic regions 
25 within the thrombospondin 2 (THBS2) gene, the angiotensin converting enzyme 1 (ACE) 

gene, and the beta fibrinogen (FGB) gene. The polymorphic regions of the invention contain 
polymorphisms which correlate with specific diseases or conditions, including vascular 
diseases or disorders, including, but not limited to, atherosclerosis, coronary artery disease 
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(CAD), myocardial infarction (MI), ischemia, stroke, peripheral vascxilar diseases, venous 
thromboembolism and pulmonary embolism. 

THBS2 

5 Two SNPs in the THBS2 gene have been identified which are associated with 

vascular disease, e.g., CAD and ML The first THBS2 SNP, referred to herein as G5755e5, is 
a change from a thymidine (T) to a guanine (G) in the THBS2 gene at residue 3949 of the 
reference sequence GI 307505. The second THBS2 SNP, referred to herein as G5755e9, is a 
change firom a thymidine (T) to a cytidine (C) in the THBS2 gene at residue 4476 of the 
10 reference sequence GI 307505. These SNPs are within the 3' untranslated region of the 
THBS2 gene. Therefore, they do not result in a change in the amino acid sequence of the 
THBS2 protein. 

The valiant allele, G, of the THBS2 SNP G5755e5, was previously shown to be 

associated with vascular disease, e.g., Ml and CAD. Individuals homozygous for the variant 
15 allele (GG) are at greater than 2-fold decreased odds of having vascular disease. 

Homozygous carriers of the variant allele of the G5755e9 SNP (CC) also showed a ~3-fold 

decreased odds of vascular disease. 

These two SNPs, G5755e5 and G5755e9, are in significant negative linkage 

disequilibrium with each other (D'=.49 (-), p=.04). The two SNPs together reveal distinct 
20 patterns of risk. Pattern 1 comprises two copies of the variant allele of G5755e9 (CC) in 

combination with two copies of the reference allele of G5755e5 (TT). Pattern 2 comprises 

two copies of the reference allele of G5755e9 (TT) and two copies of the variant allele of 

G5755e5 (GG). Patterns 1 and 2 may independently influence risk of vascular disease. 

Individuals who have pattern 1 or pattern 2 are at ~3-fold decreased odds of vascular disease 
25 relative to persons with any other combination of genotypes for these two SNPs (odds 

ratio=0.32, p=.001). Thus, individuals with pattern 1 or pattern 2 are protected against 

vascular disease, e.g., CAD and/or MI. 
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ACE 

A SNP in the ACE gene, identified herein as G765u2, has been identified which is 
also associated with a decreased risk of vascular disease, e.g., MI and CAD, in a subject. 
The G765u2 SNP is a change from an adenine (A) to a guanine (G) at nucleotide residue 
5 86408 of the ACE reference sequence GI 13027555. This SNP is a "silent" variant. That is, 
it does not result in a change in the amino acid sequence of the ACE protein. Individuals 
with one copy of an A (the reference allele) and one copy of a G (the variant allele) at 
nucleotide residue 86408 of the ACE reference sequence GI 13027555 (AG genotype) are at 
a decreased risk for vascular disease, e.g., CAD or MI (CAD odds ratio:0.71; MI odds ratio: 
10 0.66) relative to persons with other genotypes for this SNP (e.g.^ AA or GG genotypes).. 

Thus, individuals with this genotype are protected against vascular disease, e.g. CAD and/or 
MI. 

An insertion/deletion polymorphism in the ACE gene was previously associated with 
vascular disease, e.g., associated with a decreased risk for MI (as described in Cambien F, et 

15 al. (1992) Nature 359: 641-644, incorporated herein in its entirety by reference). The 

G765u2 SNP may be found to be in linkage disequilibrium with the previously identified 
insertion/deletion polymorphism. If these two polymorphisms are in linkage disequilibrium 
(LD), the G765u2 SNP would act as a marker for the insertion/deletion polymorphism. 
Regardless of LD between these two polymorphisms, the G765u2 SNP represents a novel 

20 association with vascular disease. 

FGB 

Two SNPs in the FGB gene, identified herein as FGBul and FGBu4, have been 
identified which are associated with a decreased risk of vascular disease, e.g., CAD and/or 
25 ML The first SNP, FGBul, is a change firom a cytidine (C) to a thymidine (T) at nucleotide 
residue 51 18 of the FGB reference sequence GI 182597. This SNP is a silent variant. The 
second SNP, FGBu4, is a change jfrom a guanine (G) to an adenine (A) at nucleotide residue 
8059 in the reference sequence GI 182597. This polymorphism is a missense variation 
which results in a change firom an arginine (R) to a lysine (K) in the amino acid sequence of 
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FGB (SEQ ID N0:6) at amino acid residue 478. For the FGBul SNP, individuals with two 
copies of a T (the variant allele) at nucleotide residue 5 1 1 9 of the FGB reference sequence GI 
182597 are at a decreased risk for vascular disease, e.g., CAD or MI (CAD odds ratio: 0.28; 
MI odds ratio: 0.43) relative to persons with the CC genotype. Individuals with one copy of 
5 a T and one copy of a C (the reference allele) at nucleotide residue 5119 of the FGB 

reference sequence GI 182597 are also at a decreased risk for vascular disease, e.g., CAD or 
MI (CAD odds ratio: 0.66; MI odds ratio: 0.72) relative to persons with the CC genotype. 
Thus, individuals with the TT or CT genotype at nucleotide residue 51 19 of the FGB 
reference sequence GI 182597 are protected against vascular disease, e.g. CAD and/or MI. 
10 For the FGBu4 SNP, individuals with two copies of an A (the variant allele) at 

i;;3 nucleotide residue 8059 of the FGB reference sequence GI 182597 are at a decreased risk for 

=j vascular disease, e.g., CAD or MI (CAD odds ratio: 0.28; MI odds ratio: 0.43) relative to 

.1^ persons with the GG genotype. Individuals with one copy of an A and one copy of a G (the 

'=F reference allele) at nucleotide residue 51 19 of the FGB reference sequence GI 182597 are 

15 also at a decreased risk for vascular disease, e.g, CAD or MI (CAD odds ratio: 0.61 ; MI 

i: :: : 
J ; 3 

j I odds ratio: 0.66) relative to persons with the GG genotype. Thus, individuals with the AA 

III; or GA genotype at nucleotide residue 8059 of the FGB reference sequence GI 182597 are 

also protected against vascular disease, e.g. CAD and/or MI. 

Other variants including one in the promoter region of the FGB gene at nucleotide 
20 residue -455 (as described in Shea S, et al {\999) Am J Epidemiol, 159:737-46, incorporated 
herein in its entirely by reference), have been previously associated with vascular disease, 
e.g, CAD and MI, The FGBul and FGBu4 SNPs may be found to be in linkage 
disequilibrium with these previously identified SNPs. If these SNPs are in linkage 
disequilibrium (LD), the FGBul and FGBu4 SNPs would act as markers for the previously 
25 identified SNPs. Regardless of LD, the FGBul and FGBu4 SNPs represent novel 
associations with vascular disease. 

The polymorphisms of the present invention are single nucleotide polymorphisms 
(SNPs) at a specific nucleotide residue within the THBS2 gene, the ACE gene, and FGB 
gene. The THBS2 gene, the ACE gene, and FGB gene have at least two alleles, referred to 
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herein as the reference allele and the variant allele. The reference alleles (i.e,, the consensus 
sequences) have been designated based on their frequency in a general United States 
Caucasian population sample. The reference allele is the more common of the two alleles; 
the variant allele is the more rare of the two alleles. Nucleotide sequences in GenBank may 
5 correspond to either allele and correspond to the nucleotide sequence of the nucleotide 

sequence which has been deposited in GenBank™ and given a specific Accession Number 
(e.g., GI 307505, the reference sequence for the THBS2 gene, GI 13027555, the reference 
sequence for the ACE gene, and GI 182597, the reference sequence for the FGB gene, 
corresponding to SEQ ID NO:l, SEQ ID NO:3, and SEQ ID NO:5, respectively). The 
J 10 reference sequence for the amino acid sequences of THBS2, ACE, and FGB proteins are set 

1:3 forth as SEQ ID NO:2, SEQ ID NO:4, and SEQ ID N0:6, respectively. The variant allele 

differs from the reference allele by at least one nucleotide at the site(s) identified in Tables 1, 
4, and 6 (see Example 1, below), and those in linkage disequilibrium therewith. The present 

=P invention thus relates to nucleotides comprising variant alleles of the THBS2, ACE, and/or 

15 FGB reference sequences, and/or complements of the variant alleles to be used singly or in 

1 1 combination with each other. 

;:r The invention further relates to nucleotides comprising portions of the variant sdleles 

' - and/or portions of complements of the variant alleles which comprise the site of the 

polymorphism and are at least 5 nucleotides or basepairs in length. Portions can be, for 
20 example, 5-10, 5-15, 10-20, 2-25, 10-30, 10-50 or 10-100 bases or basepairs long. For 

example, a portion of a variant allele which is 17 nucleotides or basepairs in length includes 
the polymorphism (i.e., the nucleotide(s) which differ from the reference allele at that site) 
and twenty additional nucleotides or basepairs which flank the site in the variant allele. 
These additional nucleotides and basepairs can be on one or both sides of the polymorphism. 
25 Polymorphisms which are the subject of this invention are defined in Tables 1 , 4, and 6 with 
respect to the reference sequences identified in Tables 1, 4, and 6 (GI 307505, GI 13027555, 
and GI 182597), and those polymorphisms in linkage disequilibrium with the polymorphisms 
of Tables 1, 4, and 6. For example, the invention relates to nucleotides comprising a portion 
of the THBS2 gene having a nucleotide sequence of GI 307505 (SEQ ID NO: 1), or a portion 
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thereof, comprising a polymorphism at a specific nucleotide residue (e.g., a guanine at 
nucleotide residue 3949 of GI 307505 or a cytidine at nucleotide residue 4476, or the 
complement thereof), nucleotides comprising a portion of the ACE gene having a nucleotide 
sequence of GI 13027555 (SEQ ID NO:3), or a portion thereof, comprising a polymorphism 
5 at a specific nucleotide residue (e.g., a guanine at residue 86408, or the complement thereof), 
or nucleotides comprising a portion of the FGB gene having a nucleotide sequence of GI 
182597 (SEQ ID NO: 5), or a portion thereof, comprising a polymorphism at a specific 
nucleotide residue (e.g., a thymidine at residue 51 19 or an adenine at residue 8059, or the 
complement thereof). 

It 10 Specific reference nucleotide (SEQ ID NO: 1 ) and amino acid (SEQ ID NO: 2) 

■J a:r 

P sequences for THBS2 are shown in Figures 1 and 2, respectively. Specific reference 

•sj nucleotide (SEQ ID NO:3) and amino acid (SEQ ID NO: 4) sequences for ACE are shown in 

Figures 3 and 4, respectively. Specific reference nucleotide (SEQ ID NO:5) and amino acid 

• y 

(SEQ ID NO: 6) sequences for FGB are shown in Figures 5 and 6, respectively. It is 

it 

15 understood that the invention is not limited by these exemplified reference sequences, as 

variants of these sequences which differ at locations other than the SNP sites identified 

herein can also be utilized. The skilled artisan can readily determine the SNP sites in these 
H other reference sequences which correspond to the SNP sites identified herein by aligning the 

sequence of interest with the reference sequences specifically disclosed herein. Programs for 
20 performing such alignments are commercially available. For example, the ALIGN program 

in the GCG software package can be used, utilizing a PAM120 weight residue table, a gap 

length penalty of 12 and a gap penalty of 4, for example. 

The polymorphic region of the present invention is associated with specific diseases 

or disorders and has been identified in the human THBS2, ACE, and FGB genes by 
25 analyzing the DNA of human populations. In particular, 352 U.S. Caucasian gene by 

analyzing the DNA of cell lines derived from an ethnically diverse population by methods 

described in Cargill, et al (1999) Nature Genetics 22:231-238. 

Cases which were used to identify associations between vascular disease and SNPs 

were comprised of 352 U.S. Caucasian subjects with premature coronary artery disease were 
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identified in 15 participating medical centers, fulfilling the criteria of either myocardial 
infarction, surgical or percutaneous revascularization, or a significant coronary artery lesion 
diagnosed before age 45 in men or age 50 in women and having a living sibling who met the 
same criteria. These cases were compared with a random sample of 418 Caucasian controls 
5 drawn from the general U.S. population in Atlanta, Georgia. 

The allelic variants of the present invention were identified by performing denaturing 
high performance liquid chromatography (DHPLC) analysis, variant detector arrays 
(Affymetrix™), the polymerase chain reaction (PGR), and/or single stranded conformation 
polymorphism (SSCP) analysis of genomic DNA firom independent individuals as described 
10 in the Examples, using PGR primers complementary to intronic sequences surrounding each 
of the exons, 3' UTR, and 5' upstream regulatory element sequences of the THBS2, ACE, 
and FGB genes. 

The presence of at least one polymorphism in the ACE gene in the population studied 
was identified was identified and at least two polymorphisms in the THBS2, and FGB genes 

15 in the population studied were identified. Both of the variants are characterized as single 

nucleotide polymorphisms (SNPs). The preferred polymorphisms of the invention are listed 
in Tables 1, 4, and 6. 

Tables 1, 4, and 6 contains a "polymorphism ID No." in column 2, which is used 
herein to identify each individual variant. In Tables 1, 4, and 6, the nucleotide sequence 

20 flanking each polymorphism is provided in column 9, wherein the polymorphic residue(s), 
having the variant nucleotide, is indicated in lower-case letters. There are 15 nucleotides 
flanking the polymorphic nucleotide residue (/.^., 15 nucleotides 5' of the polymorphism and 
15 nucleotides 3' of the polymorphism). Column 10 indicates the SEQ ID NO. that is used to 
identify each polymorphism. SEQ ID NOs.:7, 8, 9, 10, and 1 1 comprise sequences shown in 

25 column 9 with the variant nucleotide at the residue(s) shown in lower-case letters. 

Each polymorphism is identified based on a change in the nucleotide sequence from 
a consensus sequence, or the "reference sequence." To identify the location of each 
polymorphism in Tables 1, 4, and 6, a specific nucleotide residue in a reference sequence is 
listed for each polymorphism, where nucleotide residue number 1 is the first 5') 
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nucleotide in GI 307505 (the reference sequence for the THBS2 gene, corresponding to SEQ 
ID NO:l), the first nucleotide in GI 13027555 (the reference sequence for the ACE gene, 
corresponding to SEQ ID NO:3), and the first nucleotide in GI 182597 (the reference 
sequence for the FGB gene, corresponding to SEQ ID NO:5). Column 8 lists the reference 
5 sequence and polymorphic residue for each polymorphism. 

Column 4 describes the type of variant for each SNP. The SNPs of the present 
invention result in either a silent variant, a missense variant, or a 3' untranslated region 
variant. For example, as can be seen in Tables 1, 4, and 6, both THBS2 SNPs (G5755e5 and 
G5755e9) are located in the 3' UTR of the THBS2 gene. The ACE SNP (G765u2) is a silent 
10 variant. The FGBul SNP in the FGB gene is also a silent variant. The FGBu4 SNP in the 
FGB gene results in a change from an arginine (R) to a lysine (K). Therefore, this SNP is 
identified as a missense SNP. 

The nucleic acid molecules of the invention can be double- or single-stranded. 
Accordingly, the invention further provides for the complementary nucleic acid strands 
15 comprising the polymorphisms listed in Tables 1, 4, and 6. 

The invention further provides allele-specific oligonucleotides that hybridize to a 
gene comprising a single nucleotide polymorphism or to the complement of the gene. Such 
oligonucleotides will hybridize to one polymorphic form of the nucleic acid molecules 
described herein but not to the other polymorphic form(s) of the sequence. Thus such 
20 oligonucleotides can be used to determine the presence or absence of particular alleles of the 
polymorphic sequences described herein. These oligonucleotides can be probes or primers. 

Not only does the present invention provide polymorphisms in linkage disequilibrium 
with the polymorphisms of Tables 1 , 4, and 6, it also provides methods for revealing the 
existence of yet other polymorphic regions in the human THBS2, ACE, or FGB gene. For 
25 example, the polymorphism studies described herein can also be applied to populations in 
which other vascular diseases or disorders are prevalent. 

Other aspects of the invention are described below or will be apparent to one of skill 
in the art in light of the present disclosure. 
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Definitions 

For convenience, the meaning of certain terms and phrases employed in the 
specification, examples, and appended claims are provided below. 

The term "allele", which is used interchangeably herein with "allelic variant" refers to 
5 alternative forms of a gene or portions thereof Alleles occupy the same locus or position on 
homologous chromosomes. When a subject has two identical alleles of a gene, the subject is 
said to be homozygous for the gene or allele. When a subject has two different alleles of a 
gene, the subject is said to be heterozygous for the gene or allele. Alleles of a specific gene, 
including the THBS2, ACE, or FGB genes, can differ from each other in a single nucleotide, 

10 or several nucleotides, and can include substitutions, deletions, and insertions of nucleotides. 
An allele of a gene can also be a form of a gene containing one or more mutations. 

The term "allelic variant of a polymorphic region of a THBS2, ACE, or FGB gene" 
refers to an alternative form of the THBS2, ACE, or FGB gene having one of several 
possible nucleotide sequences found in that region of the gene in the population. 

15 "Biological activity'' or "bioactivity" or "activity" or "biological function", which are 

used interchangeably, for the purposes herein when applied to THBS2, ACE, or FGB, means 
an effector or antigenic function that is directly or indirectly performed by a THBS2, ACE, 
or FGB polypeptide (whether in its native or denatxired conformation), or by a fragment 
thereof. Biological activities include modulation of the development of atherosclerotic 

20 plaque leading to vascular disease and other biological activities, whether presently known or 
inherent. A THBS2, ACE, or FGB bioactivity can be modulated by directly affecting a 
THBS2, ACE, or FGB protein effected by, for example, changing the level of effector or 
substrate level Alternatively, a THBS2, ACE, or FGB bioactivity can be modulated by 
modulating the level of a THBS2, ACE, or FGB protein, such as by modulating expression 

25 of a THBS2, ACE, or FGB gene. Antigenic functions include possession of an epitope or 
antigenic site that is capable of cross-reacting with antibodies that bind a native or denatured 
THBS2, ACE, or FGB polypeptide or fragment thereof. 

Biologically active THBS2, ACE, or FGB polypeptides include polypeptides having 
both an effector and antigenic function, or only one of such functions. THBS2, ACE, or 
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FGB polypeptides include antagonist polypeptides and native THBS2, ACE, or FGB 
polypeptides, provided that such antagonists include an epitope of a native THBS2, ACE, or 
FGB polypeptide. An effector function of THBS2, ACE, or FGB polypeptide can be the 
ability to bind to a ligand of a THBS2, ACE, or FGB molecule. 
5 As used herein the term "bioactive fragment of a THBS2, ACE, or FGB protein" 

refers to a fragment of a full-length THBS2, ACE, or FGB protein, wherein the fragment 
specifically mimics or antagonizes the activity of a wild-type THBS2, ACE, or FGB protein. 
The bioactive jfragment preferably is a fragment capable of binding to a second molecule, 
such as a ligand. 

1 0 The term "an aberrant activity" or "abnormal activity", as applied to an activity of a 

Q protein such as THBS2, ACE, or FGB, refers to an activity which differs from the activity of 

S;j the wild-type (i.e., normal or reference) protein or which differs from the activity of the 

^■u protein in a healthy subject, e.g., a subject not afflicted with a disease associated with a 

i ^ 

THBS2, ACE, or FGB allelic variant. An activity of a protein can be aberrant because it is 

I' -fc 15 stronger than the activity of its wild-type counterpart. Alternatively, an activity of a protein 

riJ 

i2 can be aberrant because it is weaker or absent relative to the activity of its wild-type 

counterpart. An aberrant activity can also be a change in reactivity. For example an aberrant 

M protein can interact with a different protein or ligand relative to its wild-type counterpart. A 

cell can also have aberrant THBS2, ACE, or FGB activity due to overexpression or 
20 underexpression of the THBS2, ACE, or FGB gene. Aberrant THBS2, ACE, or FGB 

activity can result from a mutation in the gene, which results, e,g,y in lower or higher binding 
affinity of a ligand to the THBS2, ACE, or FGB protein encoded by the mutated gene. 
Aberrant THBS2, ACE, or FGB activity can also result from an abnormal THBS2, ACE, or 
FGB 5' upstream regulatory element activity. 
25 "Cells," "host cells" or "recombinant host cells" are terms used interchangeably 

herein. It is understood that such terms refer not only to the particular cell but to the 
progeny or derivatives of such a cell. Because certain modifications may occur in 
succeeding generations due to either mutation or environmental influences, such progeny 
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may not, in fact, be identical to the parent cell, but are still included within the scope of the 
term as used herein. 

As used herein, the term "course of clinical therapy" refers to any chosen method to 
treat, prevent, or ameliorate a vascular disease, e,g., CAD or MI, symptoms thereof, or 
related diseases or disorders. Courses of clinical therapy include, but are not limited to, 
lifestyle changes (e.g,, changes in diet or environment), administration of medication, use of 
medical devices, such as, but not limited to, stents, angioplasty devices, defibrillators, 
pacemakers, and surgical procedures, such as, for example, percutaneous translxmiinal 
coronary balloon angioplasty (PTCA) or laser angioplasty, defibrillators, implantation of a 
stent, or other surgical intervention, such as, for example, coronary bypass grafting (CABG), 
or any combination thereof. 

As used herein, the term "gene" or "recombinant gene" refers to a nucleic acid 
molecule comprising an open reading frame and including at least one exon and (optionally) 
an intron sequence. The term "intron" refers to a DNA sequence present in a given gene 
which is spliced out during mRNA maturation. 

As used herein, the term "genetic profile" refers to the information obtained from 
identification of the specific alleles of a subject, e.g., specific alleles within a polymorphic 
region of a particular gene or genes or proteins encoded by such genes. For example, a 
THBS genetic profile refers to the specific alleles of a subject within the THBS2 gene, an 
ACE genetic profile refers to the specific alleles of a subject within the ACE gene, and a 
FGB genetic profile refers to the specific alleles of a subject within the FGB gene. For 
example, one can determine a subject's THBS2, ACE, and/or FGB genetic profile by 
determining the identity of the nucleotide present at nucleotide position 3949 and/or 
nucleotide position 4476 of SEQ ID NO:l, and/or the nucleotide present at nucleotide 
position 86408 of SEQ ID N0:3, and/or the nucleotide present at nucleotide position 5119 
and/or nucleotide position 8059 of SEQ ID NO:5. One can also determine a subject's FGB 
genetic profile by determining the identity of the amino acid present at amino acid residue 
478 of SEQ ID NO:6. The genetic profile of a particular disease can be ascertained through 
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identification of the identity of allelic variants in one or more genes which are associated 
with the particular disease. 

"Homology" or "identity" or "similarity" refers to sequence similarity between two 
peptides or between two nucleic acid molecules. Homology can be determined by 
5 comparing a position in each sequence which may be aligned for purposes of comparison. 
When a position in the compared sequence is occupied by the same base or amino acid, then 
the molecules are homologous at that position. A degree of homology between sequences is 
a function of the number of matching or homologous positions shared by the sequences. An 
"unrelated" or "non-homologous" sequence shares less than 40 % identity, though preferably 
10 less than 25 % identity, with one of the sequences of the present invention. 

i. -J 

Q To determine the percent identity of two amino acid sequences or of two nucleic 

; 

Si acids, the sequences are aligned for optimal comparison purposes (e.g., gaps can be 

introduced in the sequence of a first amino acid or nucleic acid sequence for optimal 
alignment with a second amino or nucleic acid sequence). The amino acid residues or 

1'=^ 15 nucleotides at corresponding amino acid positions or nucleotide positions are then compared. 

u2 When a position in the first sequence is occupied by the same amino acid residue or 

nucleotide as the corresponding position in the second sequence, then the molecules are 

' ==^ identical at that position. The percent identity between the two sequences is a fimction of the 

number of identical positions shared by the sequences (Le.^ % identity = number of identical 
20 positions/total number of positions (e.g., overlapping positions) xlOO). In one embodiment 
the two sequences are the same length. 

The determination of percent identity between two sequences can be accomplished 
using a mathematical algorithm. A preferred, non-limiting example of a mathematical 
algorithm utilized for the comparison of two sequences is the algorithm of Karlin and 
25 Altschul (1990) Proc. Natl. Acad ScL USA 87:2264-2268, modified as in Karlin and 
Altschul (1993) Proc. Natl Acad ScL USA 90:5873-5877. Such an algorithm is 
incorporated into the NBLAST and XBLAST programs of Altschul, et al (1990) J, Mol 
Biol 215:403-410. BLAST nucleotide searches can be performed witii the NBLAST 
program, score = 100, wordlength = 12 to obtain nucleotide sequences homologous to a 
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nucleic acid molecules of the invention. BLAST protein searches can be performed with the 
XBLAST program, score = 50, wordlength = 3 to obtain amino acid sequences homologous 
to a protein molecules of the invention. To obtain gapped alignments for comparison 
purposes, Gapped BLAST can be utilized as described in Altschul et al (1997) Nucleic Acids 
5 Res, 25:3389-3402. Alternatively, PSI-Blast can be used to perform an iterated search which 
detects distant relationships between molecules. When utilizing BLAST, Gapped BLAST, 
and PSI-Blast programs, the default parameters of the respective programs {e.g., XBLAST 
and NBLAST) can be used. Another preferred, non-limiting example of a mathematical 
algorithm utilized for the comparison of sequences is the algorithm of Myers and Miller, 

10 (1988) CABIOS 4:1 1-17. Such an algorithm is incorporated into the ALIGN program 
(version 2.0) which is part of the GCG sequence alignment software package. When 
utilizing the ALIGN program for comparing amino acid sequences, a PAM120 weight 
residue table, a gap length penalty of 12, and a gap penalty of 4 can be used. Yet another 
useful algorithm for identifying regions of local sequence similarity and alignment is the 

15 FASTA algorithm as described in Pearson and Lipman (1988) Proc. Natl. Acad. Sci. USA 
85:2444-2448. When using the FASTA algorithm for comparing nucleotide or amino acid 
sequences, a PAM120 weight residue table can, for example, be used with a Muple value of 
2. 

The term "a homolog of a nucleic acid" refers to a nucleic acid having a nucleotide 
20 sequence having a certain degree of homology with the nucleotide sequence of the nucleic 
acid or complement thereof. For example, a homolog of a double stranded nucleic acid 
having SEQ ID NO:N is intended to include nucleic acids having a nucleotide sequence 
which has a certain degree of homology with SEQ ID NO:N or with the complement thereof. 
Preferred homologs of nucleic acids are capable of hybridizing to the nucleic acid or 
25 complement thereof. 

The term "hybridization probe" or "primer" as used herein is intended to include 
oligonucleotides which hybridize bind in a base-specific manner to a complementary strand 
of a target nucleic acid. Such probes include peptide nucleic acids, and described in Nielsen 
et al, (1991) Science 254:1497-1500. Probes and primers can be any length suitable for 
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specific hybridization to the target nucleic acid sequence. The most appropriate length of the 
probe and primer may vary depending on the hybridization method in which it is being used; 
for example, particular lengths may be more appropriate for use in microfabricated arrays, 
while other lengths may be more suitable for use in classical hybridization methods. Such 
5 optimizations are known to the skilled artisan. Suitable probes and primers can range form 
about 5 nucleotides to about 30 nucleotides in length. For example, probes and primers can 
be 5, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 25, 26, 28 or 30 nucleotides in length. The probe or 
primer of the invention comprises a sequence that flanks and/or preferably overlaps, at least 
one polymorphic site occupied by any of the possible variant nucleotides. The nucleotide 

1 0 sequence of an overlapping probe or primer can correspond to the coding sequence of the 
allele or to the complement of the coding sequence of the allele. 

The term "vascular disease or disorder" as used herein refers to any disease or 
disorder effecting the vascular system, including the heart and blood vessels. A vascular 
disease or disorder includes any disease or disorder characterized by vascular dysfunction, 

15 including, for example, intravascular stenosis (narrowing) or occlusion (blockage), due to the 
development of atherosclerotic plaque and diseases and disorders resulting therefrom. 
Examples of vascular diseases and disorders include, without limitation, atherosclerosis, 
CAD, MI, ischemia, stroke, peripheral vascular diseases, venous thromboembolism and 
pulmonary embolism. 

20 The term "interact" as used herein is meant to include detectable interactions between 

molecules, such as can be detected using, for example, a binding or hybridization assay. The 
term interact is also meant to include "binding" interactions between molecules. Interactions 
may be, for example, protein-protein, protein-nucleic acid, protein-small molecule or small 
molecule-nucleic acid in nature. 

25 The term ''intronic sequence" or "intronic nucleotide sequence" refers to the 

nucleotide sequence of an intron or portion thereof . 

The term "isolated" as used herein with respect to nucleic acids, such as DNA or 
RNA, refers to molecules separated from other DNAs or RNAs, respectively, that are present 
in the natural source of the macromolecule. The term isolated as used herein also refers to a 
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nucleic acid or peptide that is substantially free of cellular material, viral material, or culture 
medium when produced by recombinant DNA techniques, or chemical precursors or other 
chemicals when chemically synthesized. Moreover, an "isolated nucleic acid" is meant to 
include nucleic acid fragments which are not naturally occurring as fragments and would not 
5 be found in the natural state. The term "isolated" is also used herein to refer to polypeptides 
which are isolated from other cellular proteins and is meant to encompass both purified and 
recombinant polypeptides. 

The term "linkage" describes the tendency of genes, alleles, loci or genetic markers to 
be inherited together as a result of their location on the same chromosome. It can be 

1 0 measured by percent recombination between the two genes, alleles, loci, or genetic markers. 
The term "linkage disequilibrium" refers to a greater than random association between 
specific alleles at two marker loci within a particular population. In general, linkage 
disequilibrium decreases with an increase in physical distance. If linkage disequilibrium 
exists between two markers, then the genotypic information at one marker can be used to 

15 make probabilistic predictions about the genotype of the second marker. 

The term "locus" refers to a specific position in a chromosome. For example, a locus 
of a THBS2, ACE, or FGB gene refers to the chromosomal position of the THBS2, ACE, or 
FGB gene. 

The term "modulation" as used herein refers to both upregulation, (i.e., activation or 
20 stimulation), for example by agonizing; and downregulation (i.e. inhibition or suppression), 
for example by antagonizing of a bioactivity (e.g. expression of a gene). 

The term "molecular structure" of a gene or a portion thereof refers to the structure as 
defined by the nucleotide content (including deletions, substitutions, additions of one or 
more nucleotides), the nucleotide sequence, the state of methylation, and/or any other 
25 modification of the gene or portion thereof. 

The term "mutated gene" refers to an allelic form of a gene that differs from the 
predominant form in a population. A mutated gene is capable of altering the phenotype of a 
subject having the mutated gene relative to a subject having the predominant form of the 
gene. If a subject must be homozygous for this mutation to have an altered phenotype, the 
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mutation is said to be recessive. If one copy of the mutated gene is sufficient to alter the 
phenotype of the subject, the mutation is said to be dominant. If a subject has one copy of 
the mutated gene and has a phenotype that is intermediate between that of a homozygous and 
that of a heterozygous subject (for that gene), the mutation is said to be co-dominant. 

As used herein^ the term "nucleic acid" refers to polynucleotides such as 
deoxyribonucleic acid (DNA), and, where appropriate, ribonucleic acid (RNA). The term 
should also be understood to include, as equivalents, derivatives, variants and analogs of 
either RNA or DNA made from nucleotide analogs, and, as applicable to the embodiment 
being described, single (sense or antisense) and double-stranded polynucleotides. 
Deoxyribonucleotides include deoxyadenosine, deoxycytidine, deoxyguanosine, and 
deoxythymidine. For purposes of clarity, when referring herein to a nucleotide of a nucleic 
acid, which can be DNA or an RNA, the terms "adenine", "cytidine", "guanine", and 
thymidine" and/or "A", "C", "G", and "T", respectively, are used. It is understood that if the 
nucleic acid is RNA, a nucleotide having a uracil base is uridine. 

The term "nucleotide sequence complementary to the nucleotide sequence set forth 
in SEQ ID NO:N" refers to the nucleotide sequence of the complementary strand of a nucleic 
acid strand having SEQ ID NO:N. The term "complementary strand" is used herein 
interchangeably with the term "complement". The complement of a nucleic acid strand can 
be the complement of a coding strand or Ihe complement of a non-coding strand. When 
referrhig to double stranded nucleic acids, the complement of a nucleic acid having SEQ ID 
NO:N refers to the complementary strand of the strand having SEQ ID NO:N or to any 
nucleic acid having the nucleotide sequence of the complementary strand of SEQ ID NO:N. 
When referring to a single stranded nucleic acid having the nucleotide sequence SEQ ID 
NO:N, the complement of this nucleic acid is a nucleic acid having a nucleotide sequence 
which is complementary to that of SEQ ID NO:N. The nucleotide sequences and 
complementary sequences thereof are always given in the 5' to 3' direction. The term 
"complement" and "reverse complement" are used interchangeably herein. 

A "non-human animal" of the invention can include mammals such as rodents, 
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non-hximan primates, sheep, goats, horses, dogs, cows, chickens, amphibians, reptiles, etc. 
Preferred non-human animals are selected from the rodent family including rat and mouse, 
most preferably mouse, though transgenic amphibians, such as members of the Xenopus 
genus, and transgenic chickens can also provide important tools for understanding and 
5 identifying agents which can affect, for example, embryogenesis and tissue formation. The 
term "chimeric animal" is used herein to refer to animals in which an exogenous sequence is 
foimd, or in which an exogenous sequence is expressed in some but not all cells of the 
animal. The term "tissue-specific chimeric animal" indicates that an exogenous sequence is 
present and/or expressed or disrupted in some tissues, but not others. 
10 The term "oligonucleotide" is intended to include and single- or double stranded 

.1 as, 

DNA or RNA. Oligonucleotides can be naturally occurring or synthetic, but are typically 

'J prepared by synthetic means. Preferred oligonucleotides of the invention include segments 

"'4 

m of THBS2, ACE, or FGB gene sequence or their complements, which include and/or flank 

any one of the polymoiphic sites shown in Tables 1 , 4, and 6. The segments can be between 
15 5 and 250 bases, and, in specific embodiments, are between 5-10, 5-20, 10-20, 10-50, 20-50 

or 10-100 bases. For example, the segments can be 21 bases. The polymorphic site can 
i'^ occur within any position of the segment or a region next to the segment. The segments can 

be from any of the allelic forms of THBS2, ACE, or FGB gene sequence shown in Tables 1, 

4, and 6. 

20 The term "operably-linked" is intended to mean that the 5' upstream regulatory 

element is associated with a nucleic acid in such a manner as to facilitate transcription of the 
nucleic acid from the 5' upstream regulatory element. 

The term "polymorphism" refers to the coexistence of more than one form of a gene 
or portion thereof. A portion of a gene of which there are at least two different forms, i.e., 

25 two different nucleotide sequences, is referred to as a "polymorphic region of a gene." A 
polymorphic locus can be a single nucleotide, the identity of which differs in the other 
alleles. A polymorphic locus can also be more than one nucleotide long. The allelic form 
occurring most frequently in a selected population is often referred to as the reference and/or 
wildtype form. Other allelic forms are typically designated or alternative or variant alleles. 
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Diploid organisms may be homozygous or heterozygous for allelic forms. A diallelic or 
biallelic polymorphism has two forms. A trialleleic polymorphism has three forms. 

A "polymorphic gene" refers to a gene having at least one polymorphic region. 

The term "primer" as used herein, refers to a single-stranded oligonucleotide which 
5 acts as a point of initiation of template-directed DNA synthesis under appropriate conditions 
{e.g., in the presence of four different nucleoside triphosphates and as agent for 
polymerization, such as DNA or RNA polymerase or reverse transcriptase) in an appropriate 
buffer and at a suitable temperature. The length of a primer may vary but typically ranges 
from 15 to 30 nucleotides. A primer need not match the exact sequence of a template, but 
10 must be sufficiently complementary to hybridize with the template. 

The term "primer pair" refers to a set of primers including an upstream primer that 
hybridizes with the 3' end of the complement of the DNA sequence to be amplified and a 
downstream primer that hybridizes with the 3' end of the sequence to be amplified. 

The terms "protein", "polypeptide" and "peptide" are used interchangeably herein 
1 5 when referring to a gene product. 

The term "recombinant protein" refers to a polypeptide which is produced by 
recombinant DNA techniques, wherein generally, DNA encoding the polypeptide is inserted 
into a suitable expression vector which is in turn used to transform a host cell to produce the 
heterologous protein. 

20 A "regulatory element", also termed herein "regulatory sequence" is intended to 

include elements which are capable of modulating transcription from a 5' upstream 
regulatory sequence, including, but not limited to a basic promoter, and include elements 
such as enhancers and silencers. The term "enhancer", also referred to herein as "enhancer 
element", is intended to include regulatory elements capable of increasing, stimulating, or 

25 enhancing transcription from a 5' upstream regulatory element, including a basic promoter. 
The term "silencer", also referred to herein as "silencer element" is intended to include 
regulatory elements capable of decreasing, inhibiting, or repressing transcription from a 5' 
upstream regulatory element, including a basic promoter. Regulatory elements are typically 
present in 5' flanking regions of genes. Regulatory elements also may be present in other 
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regions of a gene, such as introns. Thus, it is possible that THBSl, ACE, or FGB genes have 
regulatory elements located in introns, exons, coding regions, and 3' flanking sequences. 
Such regulatory elements are also intended to be encompassed by the present invention and 
can be identified by any of the assays that can be used to identify regulatory elements in 5' 
flanking regions of genes. 

The term ''regulatory element" further encompasses "tissue specific" regulatory 
elements, i.e., regulatory elements which effect expression of an operably linked DNA 
sequence preferentially in specific cells (e,g., cells of a specific tissue). Gene expression 
occurs preferentially in a specific cell if expression in this cell type is significantly higher 
than expression in other cell types. The term "regulatory element" also encompasses non- 
tissue specific regulatory elements, i.e., regulatory elements which are active in most cell 
types. Furthermore, a regulatory element can be a constitutive regulatory element, i.e., a 
regulatory element which constitutively regulates transcription, as opposed to a regulatory 
element which is inducible, i.e., a regulatory element which is active primarily in response to 
a stimulus. A stimulus can be, e.g., a molecule, such as a protein, hormone, cytokine, heavy 
metal, phorbol ester, cyclic AMP (cAMP), or retinoic acid. 

Regulatory elements are typically bound by proteins, e.g., transcription factors. The 
term "transcription factor" is intended to include proteins or modified forms thereof, which 
interact preferentially with specific nucleic acid sequences, i.e., regulatory elements, and 
which in appropriate conditions stimulate or repress transcription. Some transcription factors 
are active when they are in the form of a monomer. Alternatively, other transcription factors 
are active in the form of a dimer consisting of two identical proteins or different proteins 
(heterodimer). Modified forms of transcription factors are intended to refer to transcription 
factors having a postranslational modification, such as the attachment of a phosphate group. 
The activity of a transcription factor is frequently modulated by a postranslational 
modification. For example, certain transcription factors are active only if they are 
phosphorylated on specific residues. Alternatively, transcription factors can be active in the 
absence of phosphorylated residues and become inactivated by phosphorylation. A list of 
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known transcription factors and their DNA binding site can be found, e.g., in public 
databases, e.g., TFMATRIX Transcription Factor Binding Site Profile database. 

The term "single nucleotide polymorphism" (SNP) refers to a polymorphic site 
occupied by a single nucleotide, which is the site of variation between allelic sequences. The 
site is usually preceded by and followed by highly conserved sequences of the allele (e.g., 
sequences that vary in less than 1/100 or 1/1000 members of a population). A SNP usually 
arises due to substitution of one nucleotide for another at the polymorphic site. SNPs can 
also arise from a deletion of a nucleotide or an insertion of a nucleotide relative to a reference 
allele. Typically the polymorphic site is occupied by a base other than the reference base. 
For example, where the reference allele contains the base "T" (thymidine) at the polymorphic 
site, the altered allele can contain a "C" (cytidine), "G" (guanine), or "A" (adenine) at the 
polymorphic site. 

SNP's may occur in protein-coding nucleic acid sequences, in which case they may 
give rise to a defective or otherwise variant protein, or genetic disease. Such a SNP may 
alter the coding sequence of the gene and therefore specify another amino acid (a "missense" 
SNP) or a SNP may introduce a stop codon (a "nonsense" SNP). When a SNP does not alter 
the amino acid sequence of a protein, the SNP is called "silent." SNP's may also occur in 
noncoding regions of the nucleotide sequence. This may result in defective protein 
expression, e.g., as a result of altemative spicing, or it may have no effect 

As used herein, the term "specifically hybridizes" or "specifically detects" refers to 
the ability of a nucleic acid molecule of the invention to hybridize to at least approximately 
6, 8, 10, 12, 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130 or 140 consecutive nucleotides 
of either strand of a THBS2, ACE, or FGB gene. 

As used herein, the term "transfection" means the introduction of a nucleic acid, e.g., 
an expression vector, into a recipient cell by nucleic acid-mediated gene transfer. The term 
"transduction" is generally used herein when the transfection with a nucleic acid is by viral 
delivery of the nucleic acid. "Transformation", as used herein, refers to a process in which a 
cell's genotype is changed as a result of the cellxilar uptake of exogenous DNA or RNA, and, 
for example, the transformed cell expresses a recombinant form of a polypeptide or, in the 
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case of anti-sense expression from the transferred gene, the expression of a naturally- 
occurring form of the recombinant protein is disrupted. 

As used herein, the term "transgene" refers to a nucleic acid sequence which has been 
genetic-engineered into a cell. Daughter cells deriving from a cell in which a transgene has 
5 been introduced are also said to contain the transgene (unless it has been deleted). A 
transgene can encode, e.g., a polypeptide, or an antisense transcript, partly or entirely 
heterologous, z.e., foreign, to the transgenic animal or cell into which it is introduced, or, is 
homologous to an endogenous gene of the transgenic animal or cell into which it is 
introduced, but which is designed to be inserted, or is inserted, into the animars genome in 

10 such a way as to alter the genome of the cell into which it is inserted {e.g., it is inserted at a 
location which differs from that of the natural gene or its insertion results in a knockout). 
Alternatively, a transgene can also be present in an episome. A transgene can include one or 
more transcriptional regulatory sequence and any other nucleic acid, (e.g. intron), that may 
be necessary for optimal expression of a selected nucleic acid. 

15 A "transgenic animal" refers to any animal, preferably a non-human animal, e.g. a 

mammal, bird or an amphibian, in which one or more of the cells of the animal contain 
heterologous nucleic acid introduced by genetic engineering, such as by transgenic 
techniques well known in the art. The nucleic acid is introduced into the cell, directly or 
indirectly by introduction into a precursor of the cell, by way of deliberate genetic 

20 manipulation, such as by microinjection or by infection with a recombinant virus. The term 
genetic manipulation does not include classical cross-breeding, or in vitro fertilization, but 
rather is directed to the introduction of a recombinant DNA molecule. This molecule may be 
integrated within a chromosome, or it may be extrachromosomally replicating DNA. In the 
typical transgenic animals described herein, the transgene causes cells to express a 

25 recombinant form of one of a protein, e.g. either agonistic or antagonistic forms. However, 
transgenic animals in which the recombinant gene is silent are also contemplated, as for 
example, the FLP or CRE recombinase dependent constructs described below. Moreover, 
"transgenic animal" also includes those recombinant animals in which gene disruption of one 
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or more genes is caused by human intervention, including both recombination and antisense 
techniques. 

The term "treatment", or "treating" as used herein, is defined as the apphcation or 
administration of a therapeutic agent to a subject, implementation of lifestyle changes (e.g., 
changes in diet or environment), administration of medication, use of medical devices, such 
as, but not limited to, stents, angioplasty devices, defibrillators, and surgical procedures, such 
as, for example, percutaneous transluminal coronary balloon angioplasty (PTCA) or laser 
angioplasty, implantation of a stent, or other surgical intervention, such as, for example, 
coronary bypass grafting (CABG), or any combination thereof, or application or 
administration of a therapeutic agent to an isolated tissue or cell line from a subject, v^ho has 
a disease or disorder, a symptom of disease or disorder or a predisposition toward a disease 
or disorder, with the purpose to cure, heal, alleviate, relieve, alter, remedy, ameliorate, 
improve or affect the disease or disorder, the symptoms of the disease or disorder, or the 
predisposition toward disease. 

As used herein, the term "vector" refers to a nucleic acid molecule capable of 
transporting or replicating another nucleic acid to which it has been linked. One type of 
preferred vector is an episome, he., a nucleic acid capable of extra-chromosomal replication. 
Preferred vectors are those capable of autonomous replication and/or expression of nucleic 
acids to which they are linked. Vectors capable of directing the expression of genes to which 
they are operatively-linked are referred to herein as "expression vectors". In general, 
expression vectors of utility in recombinant DNA techniques are often in the form of 
"plasmids" which refer generally to circular double stranded DNA circles which, in their 
vector form are not physically linked to the host chromosome. In the present specification, 
"plasmid" and "vector" are used interchangeably as the plasmid is the most commonly used 
form of vector. However, the invention is intended to include such other forms of expression 
vectors which serve equivalent functions and which become known in the art subsequently 
hereto. 
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Polymorphisms Used in the Methods of the Invention 

The nucleic acid molecules of the present invention include specific allelic variants of 
the THBS2, ACE, and FGB genes, which differ from the reference sequences set forth in 
SEQ ID NO:l, SEQ ID N0:3, and SEQ ID N0:5, respectively, or at least a portion thereof, 
having a polymorphic region. The preferred nucleic acid molecules of the present invention 
comprise THBS2, ACE, and FGB sequences having one or more of the polymorphisms 
shown in Tables 1, 4, and 6 (SEQ ID NOs.:7, 8, 9, 10, and 1 1), and those in linkage 
disequilibrium therewith. The invention further comprises isolated nucleic acid molecules 
complementary to nucleic acid molecules comprising the polymorphisms of the present 
invention. Nucleic acid molecules of the present invention can function as probes or primers, 
e,g„ in methods for determining the allelic identity of a THBS2, ACE, or FGB polymorphic 
region. The nucleic acids of the invention can also be used, singly, or in combination, to 
determine whether a subject is or is not at risk of developing a disease associated with a 
specific allelic variant of a THBS2, ACE, or FGB polymorphic region, e.g,, a vascular 
disease or disorder. The nucleic acids of the invention can further be used to prepare or 
express THBS2, ACE, or FGB polypeptides encoded by specific alleles, such as mutant 
alleles. Such nucleic acids can be used in gene therapy. Polypeptides encoded by specific 
THBS2, ACE, or FGB alleles, such as mutant THBS2, ACE, or FGB polj^eptideSj, can also 
be used in therapy or for preparing reagents, e.g,, antibodies, for detecting THBS2, ACE, or 
FGB proteins encoded by these alleles. Accordingly, such reagents can be used to detect 
mutant THBS2, ACE, or FGB proteins. 

As described herein, allelic variants of human THBS2, ACE, or FGB genes have 
been identified. The invention is intended to encompass these allelic variants as well as, 
those in linkage disequilibrium which can be identified, e.g., according to the methods 
described herein. "Linkage disequilibrium" refers to an association between specific alleles 
at two marker loci within a particular population. In general, linkage disequilbrium 
decreases with an increase in physical distance. If linkage disequilbrium exists between two 
markers, then the genotypic information at one marker can be used to make predictions about 
the genotype of the second marker. 
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The invention also provides isolated nucleic acids comprising at least one 
polymorphic region of a THBS2, ACE, or FGB gene having a nucleotide sequence which 
differs from the reference nucleotide sequence set forth in SEQ ID NO:l, SEQ ID N0:3, or 
SEQ ID N0:5 respectively. Preferred nucleic acids have a variant allele located in the 
5 coding region of a THBS2, ACE, or FGB gene, the upstream regulatory element, an axon, or 
in the 3' UTR of a THBS2, ACE, or FGB gene. Accordingly, preferred nucleic acids of the 
invention comprise a guanine at residue 3949 of GI 307505, and/or a cytidine at residue 4476 
of GI 307505 (as set forth in SEQ ID N0:1), or the complement thereof, and/or a guajiine at 
residue 455299 of GI 13027555 (as set forth in SEQ ID NO:3), or the complement thereof, 
10 and/or a thymidine at residue 5 1 19 of GI 1 82597, and/or an adenine at residue 8059 of GI 
j 1 82597 (set forth herein as SEQ ID NO:5). Preferred nucleic acids used in combination in 

%l the methods of the invention to predict decreased risk of vascular diseases or disorders 

"'4 

i ij comprise "pattern 1 which comprises two copies of the variant allele of G5755e9 (CC) in 

combination with two copies of the reference allele of G5755e5 (TT) or ''pattern 2", which 
|- ;^ 15 comprises two copies of the reference allele of G5755e9 (TT) and two copies of the variant 

1 s 

! -W 

j;^ allele of G5755e5 (GG) is at approximately 3-fold decreased odds of vascular disease. 

'•Z Other preferred nucleic acids used in the methods of the invention to predict 

decreased risk of vascular diseases or disorders comprise one copy of an A and one copy of a 
G at nucleotide residue 86408 of the ACE reference sequence GI 13027555 (AG genotype) is 
20 at a decreased risk for vascular disease. 

Still other preferred nucleic acids used in the methods of the invention to predict 
decreased risk of vascular diseases or disorders comprise two copies of a T at nucleotide 
residue 5 1 1 9 of the FGB reference sequence GI 1 82597 is at a decreased risk for vascular 
disease, e.g., CAD and MI. A subject having one copy of a T and one copy of a C at 
25 nucleotide residue 51 19 of the FGB reference sequence GI 182597 is also at a decreased risk 
for vascular disease, e.g., CAD and MI. 

Other preferred nucleic acids used in the methods of the invention to predict 
decreased risk of vascular diseases or disorders comprise two copies of an A at nucleotide 
residue 8059 of the FGB reference sequence GI 1 82597 is at a decreased risk for vascular 
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disease. A subject having one copy of an A and one copy of a G at nucleotide residue 5119 
of the FGB reference sequence GI 182597 is also at a decreased risk for vascular disease (see 
Example 1, below). 

The nucleic acid molecules of the present invention can be single stranded DNA {e,g., 
5 an oligonucleotide), double stranded DNA {e.g., double stranded oligonucleotide) or RNA. 
Preferred nucleic acid molecules of the invention can be used as probes or primers. Primers 
of the invention refer to nucleic acids which hybridize to a nucleic acid sequence which is 
adjacent to the region of interest or which covers the region of interest and is extended. As 
used herein, the term "hybridizes" is intended to describe conditions for hybridization and 
% 10 washing under which nucleotide sequences that are significantly identical or homologous to 
J f each other remain hybridized to each other. Preferably, the conditions are such that 

Sf sequences at least about 70%, more preferably at least about 80%, even more preferably at 

m least about 85% or 90% identical to each other remain hybridized to each other. Such 

''''^ stringent conditions vary according to the length of the involved nucleotide sequence but are 

1;;;^ 15 known to those skilled in the art and can be found or determined based on teachings in 
\:A Current Protocols in Molecular Biology^ Ausubel et al, eds., John Wiley & Sons, Inc. 

(1995), sections 2, 4 and 6. Additional stringent conditions and formulas for determining 
such conditions can be found in Molecular Cloning: A Laboratory Manual, Sambrook et al , 
Cold Spring Harbor Press, Cold Spring Harbor, NY (1989), chapters 7, 9 and 1 1 . A 
20 preferred, non-limiting example of stringent hybridization conditions for hybrids that are at 
least basepairs in length includes hybridization in 4X sodiimi chloride/sodixmi citrate (SSC), 
at about 65-70'^C (or hybridization in 4X SSC plus 50% formamide at about 42-50°C) 
followed by one or more washes in IX SSC, at about 65-70^C. A preferred, non-limiting 
example of highly stringent hybridization conditions for such hybrids includes hybridization 
25 in IX SSC, at about 65-70^C (or hybridization in IX SSC plus 50% formamide at about 42- 
50*^C) followed by one or more washes in 0.3X SSC, at about 65-70°C. A preferred, non- 
limiting example of reduced stringency hybridization conditions for such hybrids includes 
hybridization in 4X SSC, at about 50-60°C (or alternatively hybridization in 6X SSC plus 
50% formamide at about 40-45°C) followed by one or more washes in 2X SSC, at about 50- 
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eO^^C. Ranges intermediate to the above-recited values, e.g., at 65-70°C or at 42-50°C are 
also intended to be encompassed by the present invention. SSPE (IxSSPE is 0.1 5M NaCl, 
lOmM NaH2P04, and 1.25mM EDTA, pH 7.4) can be substituted for SSC (IxSSC is 0.15M 
NaCl and 15mM sodium citrate) in the hybridization and wash buffers; washes are 
5 performed for 15 minutes each after hybridization is complete. 

The hybridization temperature for hybrids anticipated to be less than 50 base pairs in 
length should be 5-1 O^C less than the melting temperature (T^) of the hybrid, where is 
determined according to the following equations. For hybrids less than 18 base pairs in 
length, TXQ = 2(# of A + T bases) + 4(# of G + C bases). For hybrids between 18 and 49 
S 10 base pairs in length, T^Q = 81.5 + 16.6(logio[Nar]) + 0.41(%G+C) - (600/N), where N is 

.'.•SSf 

I'f the number of bases in the hybrid, and [Na^] is the concentration of sodium ions in the 

'''"4 hybridization buffer ([Na^] for IxSSC = 0.165 M). It will also be recognized by the skilled 

fij practitioner that additional reagents may be added to hybridization and/or wash buffers to 

decrease non-specific hybridization of nucleic acid molecules to membranes, for example, 
15 nitrocellulose or nylon membranes, including but not limited to blocking agents (e.g., BSA 
=^ or salmon or herring sperm carrier DNA), detergents (e.g. , SDS), chelating agents (e.g. , 

1 EDTA), FicoU, PVP and the like. When using nylon membranes, in particular, an additional 

preferred, non-limiting example of stringent hybridization conditions is hybridization in 
0,25-0.5M NaH2P04, 7% SDS at about 65°C, followed by one or more washes at 0.02M 
20 NaH2P04, 1% SDS at 65°C, see e,g, Church and Gilbert (1984) Proc, Natl Acad Scl USA 
81:1991-1995, (or alternatively 0.2X SSC, 1% SDS). 

A primer or probe can be used alone in a detection method, or a primer can be used 
together with at least one other primer or probe in a detection method. Primers can also be 
used to amplify at least a portion of a nucleic acid. Probes of the invention refer to nucleic 
25 acids which hybridize to the region of interest and which are not further extended. For 

example, a probe is a nucleic acid which specifically hybridizes to a pol3anorphic region of a 
THBS2, ACE, or FGB gene, and which by hybridization or absence of hybridization to the 
DNA of a subject or the type of hybrid formed will be indicative of the identity of the allelic 
variant of the polymorphic region of the THBS2, ACE, or FGB gene. 
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Niimerous procedures for determining the nucleotide sequence of a nucleic acid 
molecule, or for determining the presence of mutations in nucleic acid molecules include a 
nucleic acid amplification step, which can be carried out by, e.g., polymerase chain reaction 
(PCR). Accordingly, in one embodiment, the invention provides primers for amplifying 
5 portions of a THBS2, ACE, or FGB gene, such as portions of exons and/or portions of 

introns. In a preferred embodiment, the exons and/or sequences adjacent to the exons of the 
human THBS2, ACE, or FGB gene will be amplilSed to, e,g,, detect which allelic variant, if 
any, of a polymorphic region is present in the THBS2, ACE, or FGB gene of a subject. 
Preferred primers comprise a nucleotide sequence complementary a specific allelic variant of 
10 a THBS2, ACE, or FGB polymorphic region and of sufficient length to selectively hybridize 
J'^^' with a THBS2, ACE, or FGB gene. In a preferred embodiment, the primer, e.g., a 

' J substantially purified oligonucleotide, comprises a region having a nucleotide sequence 

'•■4 

y which hybridizes under stringent conditions to about 6, 8, 10, or 12, preferably 25, 30, 40, 

50, or 75 consecutive nucleotides of a THBS2, ACE, or FGB gene. In an even more 
'fl 15 preferred embodiment, the primer is capable of hybridizing to a THBS2, ACE, or FGB 

^ nucleotide sequence, complements thereof, allelic variants thereof, or complements of allelic 

™ variants thereof. For example, primers comprising a nucleotide sequence of at least about 8, 

^ 10, 12, or 15 consecutive nucleotides, at least about 25 nucleotides or having from about 15 

to about 20 nucleotides set forth in any of SEQ ID N0s:7, 8, 9, 10, or 1 1, or complement 
20 thereof are provided by the invention. Primers having a sequence of more than about 25 

nucleotides axe also within the scope of the invention. Preferred primers of the invention are 
primers that can be used in PCR for amplifying each of the exons of a THBS2, ACE, or FGB 
gene. 

Primers can be complementary to nucleotide sequences located close to each other or 
25 further apart, depending on the use of the amplified DNA. For example, primers can be 

chosen such that they amplify DNA fragments of at least about 10 nucleotides or as much as 
several kilobases. Preferably, the primers of the invention will hybridize selectively to 
THBS2, ACE, or FGB nucleotide sequences located about 150 to about 350 nucleotides 
apart. 
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For amplifying at least a portion of a nucleic acid, a forward primer (i.e., 5' primer) 
and a reverse primer (i.e., 3' primer) will preferably be used. Forward and reverse primers 
hybridize to complementary strands of a double stranded nucleic acid, such that upon 
extension from each primer, a double stranded nucleic acid is amplified. A forward primer 
5 can be a primer having a nucleotide sequence or a portion of the nucleotide sequence shown 
in Tables 1, 4, and 6 (e.g., SEQ ID NOs.:7, 8, 9, 10, and 11). A reverse primer can be a 
primer having a nucleotide sequence or a portion of the nucleotide sequence that is 
complementary to a nucleotide sequence shown in Tables 1, 4, and 6 (e.g., SEQ ID NOs.:7, 
8, 9, 10, and 11). 

10 Yet other preferred primers of the invention are nucleic acids which are capable of 

selectively hybridizing to an allelic variant of a polymorphic region of a THBS2, ACE, or 
FGB gene. Thus, such primers can be specific for a THBS2, ACE, or FGB gene sequence, 
so long as they have a nucleotide sequence which is capable of hybridizing to a THBS2, 
ACE, or FGB gene. Preferred primers are capable of specifically hybridizing to any of the 

15 allelic variants listed in Tables 1, 4, and 6. Such primers can be used, e.g., in sequence 
specific oligonucleotide priming as described further herein. 

Other preferred primers used in the methods of the invention are nucleic acids which 
are capable of hybridizing to the reference sequence of a THBS2, ACE, or FGB gene, 
thereby detecting the presence of the reference allele of an allelic variant or the absence of a 

20 variant allele in the THBS2, ACE, or FGB genes and primers capable of hybridizing to the 
variant sequence of a THBS2, ACE, or FGB gene. Such primers can be used in combination, 
e.g., primers specific for the alleles of pattern 1 or pattern 2, as described herein. The 
sequences of primers specific for the reference sequences comprising the THBS2, ACE, or 
FGB genes will be readily apparent to one of skill in the art. 

25 The THBS2, ACE, or FGB nucleic acids of the invention can also be used as probes, 

e.g., in therapeutic and diagnostic assays. For instance, the present invention provides a 
probe comprising a substantially purified oligonucleotide, which oligonucleotide comprises a 
region having a nucleotide sequence that is capable of hybridizing specifically to a region of 
a THBS2, ACE, or FGB gene which is polymorphic (e.g., SEQ ID NOs.:7, 8, 9, 10, and 11, 
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or a portion thereof). In an even more preferred embodiment of the invention, the probes are 
capable of hybridizing specifically to one allelic variant of a THBS2, ACE, or FGB gene 
having a nucleotide sequence which differs from the nucleotide sequence set forth in SEQ ID 
NOs: 1, 3, or 5. Such probes can then be used to specifically detect which allelic variant of a 
5 polymorphic region of a THBS2, ACE, or FGB gene is present in a subject. The 

polymorphic region can be located in the 5' upstream regulatory element, exon, or intron 
sequences of a THBS2, ACE, or FGB gene. 

Particularly, preferred probes of the invention have a number of nucleotides 
sufficient to allow specific hybridization to the target nucleotide sequence. Where the target 

10 nucleotide sequence is present in a large fragment of DNA, such as a genomic DNA 

fragment of several tens or hundreds of kilobases, the size of the probe may have to be longer 
to provide sufficiently specific hybridization, as compared to a probe which is used to detect 
a target sequence which is present in a shorter fragment of DNA. For example, in some 
diagnostic methods, a portion of a THBS2, ACE, or FGB gene may first be amplified and 

15 thus isolated from the rest of the chromosomal DNA and then hybridized to a probe. In such 
a situation, a shorter probe will likely provide sufficient specificity of hybridization. For 
example, a probe having a nucleotide sequence of about 10 nucleotides may be sufficient. 

In preferred embodiments, the probe or primer fiirther comprises a label attached 
thereto, which, e.g., is capable of being detected, e.g. the label group is selected from 

20 amongst radioisotopes, fluorescent compoimds, enzymes, and enzyme co-factors. 

In a preferred embodiment of the invention, the isolated nucleic acid, which is used, 
e.g., as a probe or a primer, is modified, so as to be more stable than naturally occurring 
nucleotides. Exemplary nucleic acid molecules which are modified include 
phosphoramidate, phosphothioate and methylphosphonate analogs of DNA (see also U.S. 

25 Patent Numbers 5,176,996; 5,264,564; and 5,256,775). 

The nucleic acids of the invention can also be modified at the base moiety, sugar 
moiety, or phosphate backbone, for example, to improve stability of the molecule. The 
nucleic acids, e.g., probes or primers, may include other appended groups such as peptides 
(e.g., for targeting host cell receptors in vivo)^ or agents facilitating transport across the cell 
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membrane (see, e.g,, L&tsingcr et al, (1989) Proc, Natl Acad ScL U.S.A. 86:6553-6556; 
Lemaitre et al, (1987) Proc. Natl Acad. Set U.S.A, 84:648-652; PCX Publication No. 
W08 8/098 10, published December 15, 1988), hybridization-triggered cleavage agents. (See, 
e.g., Krol et al, (1988) BioTechniques 6:958-976) or intercalating agents (See, e.g., Zon, 
5 (1988) Pharm. Res. 5:539-549). To this end, the nucleic acid of the invention may be 

conjugated to another molecule, e.g., a peptide, hybridization triggered cross-linking agent, 
transport agent, hybridization-triggered cleavage agent, etc. 

The isolated nucleic acid comprising a THBS2, ACE, or FGB intronic sequence may 
comprise at least one modified base moiety which is selected from the group including but 

10 not limited to 5-fluorouracil, 5-bromo\iracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, 

xantine, 4-acetylcji:idine, 5-(carboxyhydroxymethyl) uracil, 5-carboxymethylaminomethyl- 
2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, 
inosine, N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2- 
methyladenine, 2-methylguanine, 3-methylcytidine, 5-methylcytidine, N6-adenine, 7- 

15 methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D- 
mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6- 
isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 2- 
thiocytidine, 5-methyl-2-thiouracil, 2-thioTiracil, 4-thiouracil, 5-methyluracil, uracil-5- 
oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino-3- 

20 N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. 

The isolated nucleic acid may also comprise at least one modified sugar moiety 
selected from the group including but not limited to arabinose, 2-fluoroarabinose, xylulose, 
and hexose. 

In yet another embodiment, the nucleic acid comprises at least one modified 
25 phosphate backbone selected from the group consisting of a phosphorothioate, a 

phosphorodithioate, a phosphoramidothioate, a phosphoramidate, a phosphordiamidate, a 
methylphosphonate, an alkyl phosphotriester, and a formacetal or analog thereof. 

In yet a further embodiment, the nucleic acid is an a-anomeric oligonucleotide. An 
a-anomeric oligonucleotide forms specific double-stranded hybrids with complementary 
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RNA in which, contrary to the usual p-units, the strands run parallel to each other (Gautier et 
al, 1987, Nucl Acids Res. 15:6625-6641), The oligonucleotide is a 2'-0- 
methyhibonucleotide (Inoue et al, (1987) Nucl Acids Res. 15:6131-6148), or a chimeric 
RNA-DNA analogue (Inoue et al, (1987) FEES Lett, 215:327-330). 
5 Any nucleic acid fragment of the invention can be prepared according to methods 

well known in the art and described, e.g,, in Sambrook, J. Fritsch, E.F., and Maniatis, T. 
(1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, N. Y. For example, discrete fragments of the DNA can be prepared and 
cloned using restriction enzymes. Alternatively, discrete fragments can be prepared using 

1 0 the Polymerase Chain Reaction (PCR) using primers having an appropriate sequence. 

Oligonucleotides of the invention may be synthesized by standard methods known in 
the art, e.g. by use of an automated DNA synthesizer (such as are commercially available 
from Biosearch, Applied Biosystems, etc.). As examples, phosphorothioate oligonucleotides 
may be synthesized by the method of Stein et al ((1988) Nucl Acids Res, 16:3209), 

15 methylphosphonate oligonucleotides can be prepared by use of controlled pore glass polymer 
supports (Sarin et al, (1988), Proc, Natl Acad Set U.S.A. 85:7448-7451), etc. 

The invention also provides vectors and plasmids comprising the nucleic acids of the 
invention. For example, in one embodiment, the invention provides a vector comprising at 
least a portion of the THBS2, ACE, or FOB gene comprising a polymorphic region. Thus, 

20 the invention provides vectors for expressing at least a portion of the newly identified allelic 
variants of the human THBS2, ACE, or FOB gene, as well as other allelic variants, 
comprising a nucleotide sequence which is different from the nucleotide sequence disclosed 
in GI 307505, GI 13027555, or GI 182597, respectively. The allelic variants can be 
expressed in eukaryotic cells, e.g., cells of a subject, or in prokaryotic cells. 

25 In one embodiment, the vector comprising at least a portion of a THBS2, ACE, or 

FGB allele is introduced into a host cell, such that a protein encoded by the allele is 
synthesized. The THBS2, ACE, or FGB protein produced can be used, e.g., for the 
production of antibodies, which can be used, e.g., in methods for detecting mutant forms of 
THBS2, ACE, or FGB. Altematively, the vector can be used for gene therapy, and be, e.g., 
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introduced into a subject to produce THBS2, ACE, or FGB protein. Host cells comprising a 
vector having at least a portion of a THBS2, ACE, or FGB gene are also within the scope of 
the invention. 

5 Polypeptides of the invention 

The present invention provides isolated THBS2, ACE, or FGB polypeptides, such as 
THBS2, ACE, or FGB polypeptides which are encoded by specific allelic variants of 
THBS2, ACE, or FGB, including those identified herein, e.g., proteins encoded by nucleic 
acids which differ jfrom the reference sequence of THBS2, ACE, or FGB, or a portion 

1 0 thereof, as set forth herein. The ammo acid sequences of the THBS2, ACE, or FGB proteins 
have been deduced. The THBS2 gene encodes a 1,172 amino acid protein and is described 
in, for example, LaBelle, et al (1993) Genomics 17(1):225. The ACE gene encodes a 1,306 
amino acid protein and is described in, for example, Rieder M.J. et al (1999) Nature 
Genetics (22)1 :59. The FGB gene encodes a 491 amino acid protein and is described in, for 

15 example, Chung, et al {\9^3)Ann. K Y. Acad Set 408, 449-456. 

As shown in Table 6, one polymorphism in the FGB gene found in the population 
screened results in a change in the amino acid sequence of the FGB protein. The FGBu4 
SNP is a change from a G to an A at nucleotide residue 8059 of the reference sequence GI 
182597, which results in a change from an arginine (R) to a lysine (K) at amino acid 478 of 

20 GI 11 76 1 63 1 , the reference sequence for the FGB protein. 

In one embodiment, the THBS2, ACE, or FGB polypeptides are isolated from, or 
otherwise substantially free of other cellular proteins. The term "substantially free of other 
cellular proteins" (also referred to herein as "contaminating proteins") or "substantially pure 
or purified preparations" are defined as encompassing preparations of THBS2, ACE, or FGB 

25 polypeptides having less than about 20% (by dry weight) contaminating protein, and 
preferably having less than about 5% contaminating protein. It will be appreciated that 
ftinctional forms of the subject polypeptides can be prepared, for the first time, as purified 
preparations by xising a cloned gene as described herein. 
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Preferred THBS2, ACE, or FGB proteins of the invention have an amino acid 
sequence which is at least about 60%, 70%, 80%, 85%, 90%, or 95% identical or 
homologous to the amino acid sequence of SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6, 
respectively. Even more preferred THBS2, ACE, or FGB proteins comprise an amino acid 
5 sequence which is at least about 95%, 96%, 97%, 98%, or 99% homologous or identical to 
the amino acid sequence of SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6, respectively. 
Such proteins can be recombinant proteins, and can be, e.g., produced in vitro from nucleic 
acids comprising a specific allele of a THBS2, ACE, or FGB polymorphic region. For 
example, recombinant polypeptides preferred by the present invention can be encoded by a 

10 nucleic acid which comprises a sequence which is at least 85% homologous and more 
preferably 90% homologous and most preferably 95% homologous with a nucleotide 
sequence set forth in SEQ ID NOs:l, 3, or 5 and comprises an allele of a polymorphic region 
that differs from that set forth in SEQ ID NOs: 1, 3, or 5. Polypeptides which are encoded by 
a nucleic acid comprising a sequence that is at least about 98-99% homologous with the 

15 sequence of SEQ ID NOs: 1, 3, or 5 and comprises an allele of a polymorphic region that 
differs from that set forth in SEQ ID NOs: 1, 3, or 5 are also within the scope of the 
invention. 

In a preferred embodiment, a THBS2, ACE, or FGB protein of the present invention 
is a mammalian THBS2, ACE, or FGB protein. In an even more preferred embodiment, the 
20 THBS2, ACE, or FGB protein is a human protein. 

The invention also provides peptides that preferably are capable of functioning in one 
of either role of an agonist or antagonist of at least one biological activity of a reference 
("normal") THBS2, ACE, or FGB protein of the appended sequence listing. The term 
"evolutionarily related to," with respect to amino acid sequences of THBS2, ACE, or FGB 
25 proteins, refers to both polypeptides having amino acid sequences found in human 

populations, and also to artificially produced mutational variants of human THBS2, ACE, or 
FGB polypeptides which are derived, for example, by combinatorial mutagenesis. 

Full length proteins or fragments corresponding to one or more particular motifs 
and/or domains or to arbitrary sizes, for example, at least 5, 10, 25, 50, 75 and 100, amino 
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acids in length of THBS2, ACE, or FGB protein are within the scope of the present 
invention. 

Isolated THBS2, ACE, or FGB peptides or polypeptides can be obtained by screening 
peptides recombinantly produced from the corresponding fragment of the nucleic acid 
5 encoding such peptides. In addition, such peptides and polypeptides can be chemically 

synthesized using techniques known in the art such as conventional Merrifield solid phase f- 
Moc or t-Boc chemistry. For example, a THBS2, ACE, or FGB peptide or polypeptide of 
the present invention may be arbitrarily divided into fragments of desired length with no 
overlap of the fragments, or preferably divided into overlapping fragments of a desired 

1 0 length. The fragments can be produced (recombinantly or by chemical synthesis) and tested 
to identify those peptides or polypeptides which can fimction as either agonists or antagonists 
of a wild-type (e.g., "normal") THBS2, ACE, or FGB protein. 

In general, peptides and polypeptides referred to herein as having an activity (e.g., are 
"bioactive") of a THBS2, ACE, or FGB protein are defined as peptides and polypeptides 

15 which mimic or antagonize all or a portion of the biological/biochemical activities of a 
THBS2, ACE, or FGB protein having SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6, 
respectively, such as the ability to bind ligands. Other biological activities of the subject 
THBS2, ACE, or FGB proteins are described herein or will be reasonably apparent to those 
skilled in the art. According to the present invention, a peptide or polypeptide has biological 

20 activity if it is a specific agonist or antagonist of a naturally-occurring form of a THBS2, 
ACE, or FGB protein. 

Assays for determining whether a THBS2, ACE, or FGB protein or variant thereof, 
has one or more biological activities are well known in the art. 

Other preferred proteins of the invention are those encoded by the nucleic acids set 

25 forth in the section pertaining to nucleic acids of the invention. In particular, the invention 
provides fusion proteins, e,g„ THBS2, ACE, or FGB-immimoglobulin fusion proteins. Such 
fusion proteins can provide, e.g., enhanced stability and solubility of THBS2, ACE, or FGB 
proteins and may thus be useful in therapy. Fusion proteins can also be used to produce an 
immunogenic fragment of a THBS2, ACE, or FGB protein. For example, the VP6 capsid 
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protein of rotaviras can be used as an immunologic carrier protein for portions of the 
THBS2, ACE, or FGB polypeptide, either in the monomeric form or in the form of a viral 
particle. The nucleic acid sequences corresponding to the portion of a subject THBS2, ACE, 
or FGB protein to which antibodies are to be raised can be incorporated into a fusion gene 
5 construct which includes coding sequences for a late vaccinia virus structural protein to 
produce a set of recombinant viruses expressing fusion proteins comprising THBS2, ACE, 
or FGB epitopes as part of the virion. It has been demonstrated with the use of immunogenic 
fusion proteins utilizing the Hepatitis B surface antigen fusion proteins that recombinant 
Hepatitis B virions can be utilized in this role as well. Similarly, chimeric constructs coding 

1 0 for fusion proteins containing a portion of a THBS2, ACE, or FGB protein and the poliovims 
capsid protein can be created to enhance immunogenicity of the set of polypeptide antigens 
(see, for example, EP Publication No: 0259149; and Evans et al (1989) Nature 339:385; 
Huang et al (1988) 1 ViroL 62:3855; and Schlienger et al (1992) J. Virol 66:2). 

The Multiple antigen peptide system for peptide-based immunization can also be 

1 5 utilized to generate an immunogen, wherein a desired portion of a THBS2, ACE, or FGB 
polypeptide is obtained directly from organo-chemical synthesis of the peptide onto an 
oligomeric branching lysine core (see, for example, Posnett et al (1988) JBC 263:1719 and 
Nardelli et al (1992) J. Immunol 148:914). Antigenic determinants of THBS2, ACE, or 
FGB proteins can also be expressed and presented by bacterial cells. 

20 Fusion proteins can also facilitate the expression of proteins including the THBS2, 

ACE, or FGB polypeptides of the present invention. For example, THBS2, ACE, or FGB 
polypeptides can be generated as glutathione-S-transferase (GST-fusion) proteins. Such 
GST-fusion proteins can be easily purified, as for example by the use of glutathione- 
derivatized matrices (see, for example, Current Protocols in Molecular Biology, eds. Ausubel 

25 et al (N.Y.: John Wiley & Sons, 1991)) and used subsequently to yield purified THBS2, 
ACE, or FGB polypeptides. 

The present invention further pertains to methods of producing the subject THBS2, 
ACE, or FGB polypeptides. For example, a host cell transfected with a nucleic acid vector 
directing expression of a nucleotide sequence encoding the subject polypeptides can be 
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cultured under appropriate conditions to allow expression of the peptide to occur. Suitable 
media for cell culture are well known in the art. The recombinant THBS2, ACE, or FGB 
polypeptide can be isolated from cell culture medium, host cells, or both using techniques 
known in the art for purifying proteins including ion-exchange chromatography, gel filtration 
5 chromatography, ultrafiltration, electrophoresis, and immunoaffinity purification with 

antibodies specific for such peptide. In a preferred embodiment, the recombinant THBS2, 
ACE, or FGB polypeptide is a fusion protein containing a domain which facilitates its 
purification, such as GST fusion protein. 

Moreover, it will be generally appreciated that, under certain circvimstances, it may be 

10 advantageous to provide homologs of one of the subject THBS2, ACE, or FGB polypeptides 
which function in a Umited capacity as one of either a THBS2, ACE, or FGB agonist 
(mimetic) or a THBS2, ACE, or FGB antagonist, in order to promote or inhibit only a subset 
of the biological activities of the naturally-occurring form of the protein. Thus, specific 
biological effects can be elicited by treatment with a homolog of limited function, and with 

15 fewer side effects relative to treatment with agonists or antagonists which are directed to all 
of the biological activities of naturally occurring forms of THBS2, ACE, or FGB proteins. 

Homologs of each of the subject THBS2, ACE, or FGB proteins can be generated by 
mutagenesis, such as by discrete point mutation(s), and/or by truncation. For instance, 
mutation can give rise to homologs which retain substantially the same, or merely a subset, 

20 of the biological activity of the THBS2, ACE, or FGB polypeptide from which it was 

derived. Altematively, antagonistic forms of the protein can be generated which are able to 
inhibit the function of the naturally occurring form of the protein, such as by competitively 
binding to a THBS2, ACE, or FGB receptor. 

The recombinant THBS2, ACE, or FGB polypeptides of the present invention also 

25 include homologs of THBS2, ACE, or FGB polypeptides which differ from the THBS2, 
ACE, or FGB protein having SEQ ID N0:2, SEQ ID N0:4, or SEQ ID N0:6, respectively, 
such as versions of the protein which are resistant to proteolytic cleavage, as for example, 
due to mutations which alter ubiquitination or other enzymatic targeting associated with the 
protein. 
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THBS2, ACE, or FGB polypeptides may also be chemically modified to create 
THBS2, ACE, or FGB derivatives by forming covalent or aggregate conjugates with other 
chemical moieties, such as glycosyl groups, lipids, phosphate, acetyl groups and the like. 
Covalent derivatives of THBS2, ACE, or FGB proteins can be prepared by linking the 

5 chemical moieties to functional groups on amino acid side-chains of the protein or at the N- 
terminus or at the C-terminus of the polypeptide. 

Modification of the structure of the subject THBS2, ACE, or FGB polypeptides can 
be for such purposes as enhancing therapeutic or prophylactic efficacy, stability {e.g., ex vivo 
shelf life and resistance to proteolytic degradation), or post-translational modifications (e.g., 

10 to alter phosphorylation pattem of protein). Such modified peptides, when designed to retain 
at least one activity of the naturally-occurring form of the protein, or to produce specific 
antagonists thereof, are considered fimctional equivalents of the THBS2, ACE, or FGB 
polypeptides described in more detail herein. Such modified peptides can be produced, for 
instance, by amino acid substitution, deletion, or addition. The substitutional variant may be 

15 a substituted conserved amino acid or a substituted non-conserved amino acid. 

For example, it is reasonable to expect that an isolated replacement of a leucine with 
an isoleucine or valine, an aspartate with a glutamate, a threonine with a serine, or a similar 
replacement of an amino acid with a structurally related amino acid (i.e., isosteric and/or 
isoelectric mutations) will not have a major effect on the biological activity of the resulting 

20 molecule. Conservative replacements are those that take place within a family of amino acids 
that are related in their side chains. Genetically encoded amino acids can be divided into four 
families: (1) acidic = aspartate, glutamate; (2) basic = lysine, arginine, histidine; (3) nonpolar 
= alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan; and (4) 
xmcharged polar = glycine, asparagine, glutamine, cysteine, serine, threonine, tyrosine. In 

25 similar fashion, the amino acid repertoire can be grouped as (1) acidic = aspartate, glutamate; 
(2) basic = lysine, arginine histidine, (3) aliphatic = glycine, alanine, valine, leucine, 
isoleucine, serine, threonine, with serine and threonine optionally be grouped separately as 
aliphatic-hydroxyl; (4) aromatic = phenylalanine, tyrosine, tryptophan; (5) amide = 
asparagine, glutamine; and (6) sulfur -containing = cysteine and methionine, (see, for 
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example. Biochemistry, 2°^ ed., Ed. by L.Stryer,WH Freeman and Co.: 1981). Whether a 
change in the amino acid sequence of a peptide results in a functional THBS2, ACE, or FOB 
homolog (e.g., functional in the sense that the resulting polypeptide mimics or antagonizes 
the wild-type form) can be readily determined by assessing the ability of the variant peptide 
5 to produce a response in cells in a fashion similar to the wild-type protein, or competitively 
inhibit such a response. Polypeptides in which more than one replacement has taken place 
can readily be tested in the same manner. 

Methods 

10 The invention further provides predictive medicine methods, which are based, at 

least in part, on the discovery of THBS2, ACE, or FOB polymorphic regions which are 
associated with specific physiological states and/or diseases or disorders, e.g., vascular 
diseases or disorders such as CAD and ML These methods can be used alone, or in 
combination with other predictive medicine methods, including the identification and 

1 5 analysis of known risk factors associated with vascular disease, e.g., phenotypic factors such 
as, for example, obesity, diabetes, and/or family history. 

For example, information obtained using the diagnostic assays described herein 
(singly or in combination with information of another genetic defect which contributes to the 
same disease, e.g., a vascular disease or disorder) is useful for diagnosing or confirming that 

20 a subject has an allele of a polymorphic region which is associated with a particular disease 
or disorder, e.g., a vascular disease or disorder. Moreover, the information obtained using 
the diagnostic assays described herein, singly or in combination with information of another 
genetic defect which contributes to the same disease, e.g., a vascular disease or disorder, can 
be used to predict whether or not a subject will benefit from further diagnostic evaluation for 

25 a vascular disease or disorder. Such further diagnostic evaluation includes, but is not Umited 
to, cardiovascular imaging, such as angiography, cardiac ultrasound, coronary angiogram, 
magnetic resonance imagery, nuclear imaging, CT scan, myocardial perfusion imagery, or 
electrocardiogram, genetic analysis, e.g., identification of additional polymorphisms, e.g., 
which contribute to the same disease, familial health history analysis, lifestyle analysis, or 
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exercise stress tests, either alone or in combination. Furthermore, the diagnostic information 
obtained using the diagnostic assays described herein (singly or in combination with 
information of another genetic defect which contributes to the same disease, e.g., a vascular 
disease or disorder), may be used to identify which subject will benefit from a particular 

5 clinical course of therapy useful for preventing, treating, ameliorating, or prolonging onset of 
the particular vascular disease or disorder in the particular subject. Clinical courses of 
therapy include, but are not limited to, administration of medication, non-surgical 
intervention, surgical intervention or procedures, and use of surgical and non-surgical 
medical devices used in the treatment of vascular disease, such as, for example, stents or 

10 defibrillators, 

Altematively, the information, smgly, or in combination with information of another 
genetic defect which contributes to the same disease, e.g., a vascular disease or disorder, can 
be used prognostically for predicting whether a non-symptomatic subject is likely to develop 
a disease or condition which is associated with one or more specific alleles of THBS2, ACE, 

15 or FGB polymorphic regions in a subject. Based on the prognostic information, a health care 
provider can recommend a particular further diagnostic evaluation which will benefit the 
subject, or a particular clinical course of therapy, as described above. 

In addition, knowledge of the identity of a particular THBS2, ACE, or FGB allele in 
a subject (the THBS2, ACE, or FGB genetic profile), singly, or in combination, allows 

20 customization of further diagnostic evaluation and/or a clinical covirse of therapy for a 
particular disease. For example, a subject's THBS2, ACE, or FGB genetic profile or the 
genetic profile of a disease or disorder associated with a specific allele of a THBS2, ACE, or 
FGB polymorphic region, e.g., a vascular disease or disorder, can enable a health care 
provider: 1) to more efficiently and cost-effectively identify means for further diagnostic 

25 evaluation, including, but not limited to, further genetic analysis, familial health history 

analysis, or use of vascular imaging devices; 2) to more effectively prescribe a drug that will 
address the molecular basis of the disease or condition; 3) to more efficiently and cost- 
effectively identify an appropriate clinical coxirse of therapy, including, but not limited to, 
lifestyle changes, medications, surgical or non-surgical devices, surgical or non-surgical 
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intervention, or any combination thereof; and 4) to better determine the appropriate dosage of 
a particular drug or duration of a particular course of clinical therapy. For example, the 
expression level of THBS2, ACE, or FOB proteins, alone or in conjunction with the 
expression level of other genes, known to contribute to the same disease, can be measured in 
many subjects at various stages of the disease to generate a transcriptional or expression 
profile of the disease. Expression patterns of individual subjects can then be compared to the 
expression profile of the disease to determine the appropriate drug, dose to administer to the 
subject, or course of clinical therapy. 

The ability to target populations expected to show the highest clinical benefit, based 
on the THBS2, ACE, or FGB or disease genetic profile, can enable: 1) the repositioning of 
marketed drugs, surgical devices for use in treating, preventing, or ameliorating vascular 
diseases or disorders, or diagnostics, such as vascular imaging devices, with disappointing 
market resuUs; 2) the rescue of drug candidates whose clinical development has been 
discontinued as a result of safety or efficacy limitations, which are subject subgroup- 
specific; 3) an accelerated and less costly development for drug candidates and more 
optimal drug labeling (e.g., since the use of THBS2, ACE, or FGB as a marker is useful for 
optimizing effective dose); and 4) an accelerated, less costly, and more effective selection of 
a particular course of clinical therapy suited to a particular subject. 

These and other methods are described in further detail in the following sections. 

A. Prognostic and Diagnostic Assays 

The present methods provide means for determining if a subject is or is not at risk of 
developing a disease, condition or disorder that is associated a specific THBS2, ACE, or 
FGB allele, e.g., a vascular disease or a disease or disorder resulting therefrom. 

The present invention provides methods for determining the molecular structure of a 
THBS2, ACE, or FGB gene, such as a human THBS2, ACE, or FGB gene, or a portion 
thereof. In one embodiment, determining the molecular structure of at least a portion of a 
THBS2, ACE, or FGB gene comprises determining the identity of an allelic variant of at 
least one polymorphic region of a THBS2, ACE, or FGB gene (determining the presence or 
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absence of one or more of the allelic variants, or their complements, of SEQ ID NOs. :7, 8, 9, 
10, and/or 1 1). A polymorphic region of a THBS2, ACE, or FGB gene can be located in an 
exon, an intron, at an intron/exon border, or in the 5' upstream regulatory element of the 
THBS2, ACE, or FGB gene. 
5 The invention provides methods for determining whether a subject is or is not at risk 

of developing a disease or disorder associated with a specific allelic variant of a polymorphic 
region of a THBS2, ACE, or FGB gene. Such diseases can be associated with aberrant 
THBS2, ACE, or FGB activity, e.g., a vascular disease or disorder such as CAD or ML 

Analysis of one or more THBS2, ACE, or FGB polymorphic regions in a subject can 
|,..^ 1 0 be useful for predicting whether a subject is or is not likely to develop a vascular disease or 

I;--; disorder, e.g., atherosclerosis, CAD, MI, ischemia, stroke, peripheral vascular diseases, 

n venous thromboembolism and pulmonary embolism. 

%J In preferred embodiments, the methods of the invention can be characterized as 

[^^ comprising detecting, in a sample of cells from the subject, the presence or absence of a 

. 15 specific allelic variant of one or more polymorphic regions of a THBS2, ACE, or FGB gene, 

i U Preferably, the presence of the variant allele of the THBS2, ACE, and/or FGB gene 

:C described herein are detected. The allelic differences can be: (i) a difference in the identity 

i of at least one nucleotide or (ii) a difference in the number of nucleotides, which difference 
can be a single nucleotide or several nucleotides. The invention also provides methods for 
20 detecting differences in THBS2, ACE, or FGB genes such as chromosomal rearrangements, 
e.g., chromosomal dislocation. The invention can also be used in prenatal diagnostics. 

A preferred detection method is allele specific hybridization using probes 
overlapping the polymorphic site and having about 5, 10, 20, 25, or 30 nucleotides around 
the polymorphic region. In a preferred embodiment of the invention, several probes capable 
25 of hybridizing specifically to allelic variants are attached to a solid phase support, e.g., a 

"chip". Oligonucleotides can be bound to a sohd support by a variety of processes, including 
lithography. For example a chip can hold up to 250,000 oligonucleotides (GeneChip, 
Affymetrix^^. Mutation detection analysis using these chips comprising oligonucleotides, 
also termed "DNA probe arrays" is described e.g., in Cronin et al (1996) Human Mutation 
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7:244. In one embodiment, a chip comprises all the allelic variants of at least one 
polymorphic region of a gene. The solid phase support is then contacted with a test nucleic 
acid and hybridization to the specific probes is detected. Accordingly, the identity of 
numerous allelic variants of one or more genes can be identified in a simple hybridization 
5 experiment. For example, the identity of the allelic variant of the nucleotide polymorphism 
in the 5' upstream regulatory element can be determined in a single hybridization experiment. 

In other detection methods, it is necessary to first amplify at least a portion of a 
THBS2, ACE, or FGB gene prior to identifying the allelic variant. Amplification can be 
performed, e,g,, by PGR and/or LCR (see Wu and Wallace (1989) Genomics 4:560), 

1 0 according to methods known in the art. In one embodiment, genomic DNA of a cell is 

exposed to two PGR primers and amplification for a number of cycles sufficient to produce 
the required amount of amplified DNA. In preferred embodiments, the primers are located 
between 150 and 350 base pairs apart. 

Alternative amplification methods include: self sustained sequence replication 

15 (Guatelli, J.C. et al, (1990) Proc, Natl Acad. Set USA 87:1874-1878), transcriptional 

amplification system (Kwoh, D.Y. et al, (1989) Proc. Natl Acad. Scl USA 86:1173-1 177), 
Q-Beta Replicase (Lizardi, P.M. et al, (1988) Bio/Technology 6:1 197), and self-sustained 
sequence replication {Guatelli et al, (1989) Proc, Nat Acad. Scl 87:1874), and nucleic acid 
based sequence amplification (NABSA), or any other nucleic acid amplification method, 

20 followed by the detection of the amplified molecules using techniques well known to those 
of skill in the art. These detection schemes are especially useful for the detection of nucleic 
acid molecules if such molecules are present in very low numbers. 

In one embodiment, any of a variety of sequencing reactions known in the art can be 
used to directly sequence at least a portion of a THBS2, ACE, or FGB gene and detect alleUc 

25 variants, e.g., mutations, by comparing the sequence of the sample sequence with the 

corresponding reference (control) sequence. Exemplary sequencing reactions include those 
based on techniques developed by Maxam and Gilbert {Proc. Natl Acad Sci USA (1977) 
74:560) or Sanger (Sanger et al (1977) Proc, Nat Acad. Sci 74:5463). It is also 
contemplated that any of a variety of automated sequencing procedures may be utilized when 
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performing the subject assays {Biotechniques (1995) 19:448), including sequencing by mass 
spectrometry (see, for example, U.S. Patent Number 5,547,835 and international patent 
application Publication Number WO 94/16101, entitled DNA Sequencing by Mass 
Spectrometry by H. Koster; U.S. Patent Number 5,547,835 and intemational patent 
5 application Publication Number WO 94/21 822 entitled "DNA Sequencing by Mass 
Spectrometry Via Exonuclease Degradation" by H. Koster), and U.S Patent Number 
5,605,798 and Intemational Patent Application No. PCT/US96/03651 entitled DAM 
Diagnostics Based on Mass Spectrometry by H. Koster;. Cohen et al. (1996) A(h 
Chromatogr 36:127-162; and Griffin et al. (1993) Appl Biochem Biotechnol 38:147-159). It 
10 will be evident to one skilled in the art that, for certain embodiments, the occurrence of only 
one, two or three of the nucleic acid bases need be determined in the sequencing reaction. 
For instance, A-track or the like, e.g., where only one nucleotide is detected, can be carried 
out. 

Yet other sequencing methods are disclosed, e.g., in U.S. Patent Number 5,580,732 
1 5 entitled ''Method of DNA sequencing employing a mixed DNA-polymer chain probe" and 
U.S. Patent Number 5,571,676 entitled "Method for mismatch-directed in vitro DNA 
sequencing." 

In some cases, the presence of a specific allele of a THBS2, ACE, or FGB gene in 
DNA firom a subject can be shown by restriction enzyme analysis. For example, a specific 

20 nucleotide polymorphism can result in a nucleotide sequence comprising a restriction site 
which is absent from the nucleotide sequence of another allelic variant. 

In a further embodiment, protection from cleavage agents (such as a nuclease, 
hydroxylamine or osmium tetroxide and with piperidine) can be used to detect mismatched 
bases in RNA/RNA DNA/DNA, or RNA/DNA heteroduplexes (Myers, et al (1985) Science 

25 230: 1242). In general, the technique of "mismatch cleavage" starts by providing 

heteroduplexes formed by hybridizing a control nucleic acid, which is optionally labeled, 
e.g., RNA or DNA, comprising a nucleotide sequence of a THBS2, ACE, or FGB allelic 
variant with a sample nucleic acid, e,g.y RNA or DNA, obtained firom a tissue sample. The 
double-stranded duplexes are treated with an agent which cleaves single-stranded regions of 
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the duplex such as duplexes formed based on basepair mismatches between the control and 
sample strands. For instance, RNA/DNA duplexes can be treated with RNase and 
DNA/DNA hybrids treated with SI nuclease to enzymatically digest the mismatched regions. 
In other embodiments, either DNA/DNA or RNA/DNA duplexes can be treated with 
5 hydroxylamine or osmium tetroxide and with piperidine in order to digest mismatched 

regions. After digestion of the mismatched regions, the resulting material is then separated 
by size on denaturing polyacrylamide gels to determine whether the control and sample 
nucleic acids have an identical nucleotide sequence or in which nucleotides they are 
different. See, for example. Cotton et al (1988) Proc, Natl Acad Sci USA 85:4397; Saleeba 
10 etal (1992) Methods EnzymoL 217:286-295. In a preferred embodiment, the control or 
.'5' sample nucleic acid is labeled for detection. 

f ^ In another embodiment, an allelic variant can be identified by denaturing high- 

%j performance liquid chromatography (DHPLC) (Oefner and Underbill, (1995) Am, J. Human 

? Gem 57:Suppl. A266). DHPLC uses reverse-phase ion-pairing chromatography to detect the 

15 heteroduplexes that are generated during amplification of PGR fragments from individuals 
i U who are heterozygous at a particular nucleotide locus within that fragment (Oefner and 

I p Underbill (1995) Am. J. Human Gen, 57:Suppl. A266). In general, PCR products are 

produced using PCR primers flanking the DNA of interest. DHPLC analysis is carried out 
and the resulting chromatograms are analyzed to identify base pair alterations or deletions 
20 based on specific chromatographic profiles (see O'Donovan et al (1998) Genomics Sl'AA- 
49). 

In other embodiments, alterations in electrophoretic mobility is used to identify the 
type of THBS2, ACE, or FOB allelic variant. For example, single strand conformation 
polymorphism (SSCP) may be used to detect differences in electrophoretic mobility between 
25 mutant and wild type nucleic acids (Orita et al (1989) Proc Natl Acad, Sci USA 86:2766, 
see also Cotton (1993) Mutat Res 285:125-144; and Hayashi (1992) Genet Anal TechAppl 
9:73-79). Single-stranded DNA fragments of sample and control nucleic acids are denatured 
and allowed to renature. The secondary structure of single-stranded nucleic acids varies 
according to sequence, the resulting alteration in electrophoretic mobility enables the 
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detection of even a single base change. The DNA fragments may be labeled or detected with 
labeled probes. The sensitivity of the assay may be enhanced by using RNA (rather than 
DNA), in which the secondary structure is more sensitive to a change in sequence. In 
another preferred embodiment, the subject method utilizes heteroduplex analysis to separate 

5 double stranded heteroduplex molecules on the basis of changes in electrophoretic mobility 
(Keen et al (1991) Trends Genet 7:5). 

In yet another embodiment, the identity of an allelic variant of a polymorphic region 
is obtained by analyzing the movement of a nucleic acid comprising the polymorphic region 
in polyacrylamide gels containing a gradient of denaturant is assayed using denaturing 

10 gradient gel electrophoresis (DGGE) (Myers et al (1985) Nature 313:495). When DGGE is 
used as the method of analysis, DNA will be modified to insure that it does not completely 
denature, for example by adding a GC clamp of approximately 40 bp of high-melting GC- 
rich DNA by PGR. In a further embodiment, a temperature gradient is used in place of a 
denaturing agent gradient to identify differences in the mobility of control and sample DNA 

15 (Rosenbaum and Reissner (1987) Biophys Chem 265:1275). 

Examples of techniques for detecting differences of at least one nucleotide between 
2 nucleic acids include, but are not limited to, selective oligonucleotide hybridization, 
selective amplification, or selective primer extension. For example, oligonucleotide probes 
may be prepared in which the known polymorphic nucleotide is placed centrally (allele- 

20 specific probes) and then hybridized to target DNA under conditions which permit 

hybridization only if a perfect match is found (Saiki et al (1986) Nature 324:163); Saiki et al 
(1989) Proc. Natl Acad. Sci USA 86:6230; and Wallace et al (1979) Nucl Acids Res. 
6:3543). Such allele specific oligonucleotide hybridization techniques may be used for the 
simultaneous detection of several nucleotide changes in different polylmorphic regions of 

25 THBS2, ACE, or FGB. For example, oligonucleotides having nucleotide sequences of 
specific allelic variants are attached to a hybridizing membrane and this membrane is then 
hybridized with labeled sample nucleic acid. Analysis of the hybridization signal will then 
reveal the identity of the nucleotides of the sample nucleic acid. 
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Alternatively, allele specific amplification technology which depends on selective 
PGR amplification may be used in conjunction with the instant invention. Oligonucleotides 
used as primers for specific amplification may carry the allelic variant of interest in the 
center of the molecule (so that amplification depends on differential hybridization) (Gibbs et 
5 al (1 989) Nucleic Acids Res, 1 7:2437-2448) or at the extreme 3' end of one primer where, 
under appropriate conditions, mismatch can prevent, or reduce polymerase extension 
(Prossner (1993) Tibtech 11:238; Newton et al (1989) Nucl Acids Res. 17:2503). This 
technique is also termed "PROBE" for Probe Oligo Base Extension. In addition it may be 
desirable to introduce a novel restriction site in the region of the mutation to create cleavage- 

1 0 based detection (Gasparini et al (1 992) Mol Cell Probes 6:1). 

In another embodiment, identification of the allelic variant is carried out using an 
oHgonucleotide ligation assay (OLA), as described, e.g., in U.S. Patent Number 4,998,617 
and in Landegren, U. et al, (1988) Science 241:1077-1080. The OLA protocol uses two 
oligonucleotides which are designed to be capable of hybridizing to abutting sequences of a 

15 single strand of a target. One of the oUgonucleotides is linked to a separation marker, e.g., 
biotinylated, and the other is detectably labeled. If the precise complementary sequence is 
found in a target molecule, the oligonucleotides will hybridize such that their termini abut, 
and create a ligation substrate. Ligation then permits the labeled oligonucleotide to be 
recovered using avidin, or another biotin ligand. Nickerson, D. A. et al have described a 

20 nucleic acid detection assay that combines attributes of PGR and OLA (Nickerson, D. A. et 
al, (1990) Proc. Natl Acad. Sci (U.S.A.) 87:8923-8927. In this method, PGR is used to 
achieve the exponential amplification of target DNA, which is then detected using OLA, 

Several techniques based on this OLA method have been developed and can be used 
to detect specific allelic variants of a polymorphic region of a THBS2, AGE, or FGB gene. 

25 For example, U.S. Patent Nxmiber 5,593,826 discloses an OLA using an oligonucleotide 

having 3'-amino group and a 5'-phosphorylated oligonucleotide to form a conjugate having a 
phosphoramidate linkage. In another variation of OLA described in Tobe et al ((1996) 
Nucleic Acids Res 24: 3728), OLA combined with PGR permits typing of two alleles in a 
single microtiter well By marking each of the allele-specific primers with a unique hapten. 
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le, digoxigenin and fluorescein, each OLA reaction can be detected by using hapten specific 
antibodies that are labeled with different enzyme reporters, alkaline phosphatase or 
horseradish peroxidase. This system permits the detection of the two alleles using a high 
throughput format that leads to the production of two different colors. 

The invention further provides methods for detecting single nucleotide 
polymorphisms in a THBS2, ACE, or FGB gene. Because single nucleotide polymorphisms 
constitute sites of variation flanked by regions of invariant sequence, their analysis requires 
no more than the determination of the identity of the single nucleotide present at the site of 
variation and it is unnecessary to determine a complete gene sequence for each subject. 
Several methods have been developed to facilitate the analysis of such single nucleotide 
polymorphisms. 

In one embodiment, the single base polymorphism can be detected by using a 
specialized exonuclease-resistant nucleotide, as disclosed, e.g., in Mundy, C. R. (U.S. Patent 
Number 4,656,127). According to the method, a primer complementary to the allelic 
sequence immediately 3' to the polymorphic site is permitted to hybridize to a target 
molecule obtained from a particular animal or human. If the polymorphic site on the target 
molecule contains a nucleotide that is complementary to the particular exonuclease-resistant 
nucleotide derivative present, then that derivative will be incorporated onto the end of the 
hybridized primer. Such incorporation renders the primer resistant to exonuclease, and 
thereby permits its detection. Since the identity of the exonuclease-resistant derivative of the 
sample is known, a finding that the primer has become resistant to exonucleases reveals that 
the nucleotide present in the polymorphic site of the target molecule was complementary to 
that of the nucleotide derivative used in the reaction. This method has the advantage that it 
does not require the determination of large amounts of extraneous sequence data. 

In another embodiment of the invention, a solution-based method is used for 
determining the identity of the nucleotide of a polymorphic site. Cohen, D. et al (French 
Patent 2,650,840; PCT Apphi. No. WO91/02087). As in the Mundy method of U.S. Patent 
Number 4,656,127, a primer is employed that is complementary to allelic sequences 
immediately 3' to a polymorphic site. The method determines the identity of the nucleotide 
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of that site using labeled dideoxynucleotide derivatives, which, if complementary to the 
nucleotide of the polymorphic site will become incorporated onto the terminus of the primer. 

An altemative method, known as Genetic Bit Analysis or GBA™ is described by 
Goelet, P. et al (PCT Appln. No. 92/15712). The method of Goelet, P. et al uses mixtures of 
5 labeled terminators and a primer that is complementary to the sequence 3' to a polymorphic 
site. The labeled terminator that is incorporated is thus determined by, and complementary 
to, the nucleotide present in the polymorphic site of the target molecule being evaluated. In 
contrast to the method of Cohen et al (French Patent 2,650,840; PCT Appln. No. 
W09 1/02087) the method of Goelet, P. et al is preferably a heterogeneous phase assay, in 
10 which the primer or the target molecule is immobilized to a solid phase. 
'p^ Recently, several primer-guided nucleotide incorporation procedures for assaying 

;| ^ polymorphic sites in DNA have been described (Komher, J. S. et al, (1989) Nucl Acids. 

' J Res, Sokolov, B. P., (1990) Nucl Acids Res. 18:3671 ; Syvanen, A. -C, et al, 

jE (1990) Genomics 8:684-692; Kuppuswamy, M. N. et al, (1991) Proc. Natl Acad. Scl 

15 (U.S.A.) 88:1 143-1 147; Prezant, T. R. et al, (1992) Hum. Mutat. 1 :159-164; Ugozzoli, L. et 
^ y al, (1992) GATA 9:107-112; Nyren, P. (1993) etal,Anal Biochem. 208:171-175). These 

' P methods differ from GBA™ in that they all rely on the incorporation of labeled 

r ~"i 

i^I deoxynucleotides to discriminate between bases at a polymorphic site. In such a format, 

since the signal is proportional to the number of deoxynucleotides incorporated, 
20 polymorphisms that occur in runs of the same nucleotide can result in signals that are 
proportional to the length of the run (Syvanen, A.C., et al, (1993) Amer, J. Hum. Genet. 
52:46-59). 

For determining the identity of the allelic variant of a polymorphic region located in 
the coding region of a THBS2, ACE, or FGB gene, yet other methods than those described 
25 above can be used. For example, identification of an allelic variant which encodes a mutated 
THBS2, ACE, or FGB protein can be performed by using an antibody specifically 
recognizing the mutant protein in, e.g., immunohistochemistry or immunoprecipitation. 
Antibodies to wild-type THBS2, ACE, or FGB or mutated forms of THBS2, ACE, or FGB 
proteins can be prepared according to methods known in the art. 
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Alternatively, one can also measure an activity of a THBS2, ACE, or FGB protein, 
such as binding to a THBS2, ACE, or FGB ligand. Binding assays are known in the art and 
involve, e.g., obtaining cells from a subject, and performing binding experiments with a 
labeled ligand, to determine whether binding to the mutated form of the protein differs from 
5 binding to the wild-type of the protein. 

Antibodies directed against reference or mutant THBS2, ACE, or FGB polypeptides 
or allelic variant thereof, which are discussed above, may also be used in disease diagnostics 
and prognostics. Such diagnostic methods, may be used to detect abnormalities in the level 
of THBS2, ACE, or FGB polypeptide expression, or abnormalities in the structure and/or 

10 tissue, cellular, or subcellular location of a THBS2, ACE, or FGB polypeptide. Structural 
differences may include, for example, differences in the size, electronegativity, or 
antigenicity of the mutant THBS2, ACE, or FGB polypeptide relative to the normal THBS2, 
ACE, or FGB polypeptide. Protein from the tissue or cell type to be analyzed may easily be 
detected or isolated using techniques which are well known to one of skill in the art, 

1 5 including but not limited to Western blot analysis. For a detailed explanation of methods for 
carrying out Westem blot analysis, see Sambrook et al, 1989, supra, at Chapter 18. The 
protein detection and isolation methods employed herein may also be such as those described 
in Harlow and Lane, for example, (Harlow, E. and Lane, D., 1988, "Antibodies: A 
Laboratory Manual", Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York), 

20 which is incorporated herein by reference in its entirety. 

This can be accomplished, for example, by immunofluorescence techniques 
employing a fluorescently labeled antibody (see below) coupled with light microscopic, flow 
cytometric, or fluorimetric detection. The antibodies (or fragments thereof) usefiil in the 
present invention may, additionally, be employed histologically, as in immunofluorescence 

25 or immunoelectron microscopy, for in situ detection of THBS2, ACE, or FGB polypeptides. 
In situ detection may be accomplished by removing a histological specimen from a subject, 
and applying thereto a labeled antibody of the present invention. The antibody (or fragment) 
is preferably applied by overlaying the labeled antibody (or fragment) onto a biological 
sample. Through the use of such a procedure, it is possible to determine not only the 
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presence of the THBS2, ACE, or FGB polypeptide, but also its distribution in the examined 
tissue. Using the present invention, one of ordinary skill will readily perceive that any of a 
wide variety of histological methods (such as staining procedures) can be modified in order 
to achieve such in situ detection. 

5 Often a solid phase support or carrier is used as a support capable of binding an 

antigen or an antibody. Well-known supports or carriers include glass, polystyrene, 
polypropylene, polyethylene, dextran, nylon, amylases, natural and modified celluloses, 
polyacrylamides, gabbros, and magnetite. The nature of the carrier can be either soluble to 
some extent or insoluble for the purposes of the present invention. The support material may 

10 have virtually any possible structural configuration so long as the coupled molecule is 
capable of binding to an antigen or antibody. Thus, the support configuration may be 
spherical, as in a bead, or cylindrical, as in the inside surface of a test tube, or the external 
surface of a rod. Alternatively, the surface may be flat such as a sheet, test strip, etc. 
Preferred supports include polystyrene beads. Those skilled in the art will know many other 

15 suitable carriers for binding antibody or antigen, or will be able to ascertain the same by use 
of routine experimentation. 

One means for labeling an anti-THBS2, ACE, or FGB polypeptide specific antibody 
is via linkage to an enzyme and use in an enzyme immunoassay (EIA) (VoUer, "The Enzyme 
Linked Immunosorbent Assay (ELISA)", Diagnostic Horizons 2:1-7, 1978, Microbiological 

20 Associates Quarterly PubUcation, Walkersville, MD; VoUer, et al., (1978) J. Clin. Pathol. 
31:507-520; Butler, (19SI) Metk EnzymoL 73:482-523; Maggio, (ed.) Enzyme 
Immunoassay, CRC Press, Boca Raton, FL, 1980; Ishikawa, et al., (eds.) Enzyme 
Immunoassay, Kgaku Shoin, Tokyo, 1981). The enzyme which is bound to the antibody will 
react with an appropriate substrate, preferably a chromogenic substrate, in such a manner as 

25 to produce a chemical moiety which can be detected, for example, by spectrophotometric, 

fluorimetric or by visual means. Enzymes which can be used to detectably label the antibody 
include, but are not limited to, malate dehydrogenase, staphylococcal nuclease, delta-5- 
steroid isomerase, yeast alcohol dehydrogenase, alpha-glycerophosphate, dehydrogenase, 
triose phosphate isomerase, horseradish peroxidase, alkaline phosphatase, asparaginase, 
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glucose oxidase, beta-galactosidase, ribonuclease, urease, catalase, glucose-6-phosphate 
dehydrogenase, glucoamylase and acetylcholinesterase. The detection can be accomplished 
by colorimetric methods which employ a chromogenic substrate for the enzyme. Detection 
may also be accomplished by visual comparison of the extent of enzymatic reaction of a 

5 substrate in comparison with similarly prepared standards. 

Detection may also be accomplished using any of a variety of other immunoassays. 
For example, by radioactively labeling the antibodies or antibody fragments, it is possible to 
detect fingerprint gene wild type or mutant peptides through the use of a radioimmunoassay 
(RIA) (see, for example, Wemtraub, B,, Principles of Radioimmunoassays, Seventh Training 

10 Course on Radioligand Assay Techniques, The Endocrine Society, March, 1986, which is 
incorporated by reference herein). The radioactive isotope can be detected by such means as 
the use of a gamma counter or a scintillation counter or by autoradiography. 

It is also possible to label the antibody with a fluorescent compound. When the 
fluorescently labeled antibody is exposed to light of the proper wave length, its presence can 

15 then be detected due to fluorescence. Among the most commonly used fluorescent labeling 
compoimds are fluorescein isothiocyanate, rhodamine, phycoerythrin, phycocyanin, 
allophycocyanin, o-phthaldehyde and fluorescamine. 

The antibody can also be detectably labeled using fluorescence emitting metals such 

as ^^^Eu, or others of the lanthanide series. These metals can be attached to the antibody 
20 using such metal chelating groups as diethylenetriaminepentacetic acid (DTP A) or 

ethylenediaminetetraacetic acid (EDTA). 

The antibody also can be detectably labeled by coupling it to a chemiluminescent 

compound. The presence of the chemiluminescent-tagged ^tibody is then determined by 

detecting the presence of luminescence that arises during the course of a chemical reaction. 
25 Examples of particularly useful chemiluminescent labeling compounds are luminol, 

isoluminol, theromatic acridinium ester, imidazole, acridinium salt and oxalate ester. 

Likewise, a bioluminescent compound may be used to label the antibody of the 

present invention. Bioluminescence is a type of chemiluminescence found in biological 

systems in, which a catalytic protein increases the efficiency of the chemiluminescent 
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reaction. The presence of a bioluminescent protein is determined by detecting the presence 
of luminescence. Important bioluminescent compounds for purposes of labeling are 
luciferin, luciferase and aequorin. 

If a polymorphic region is located in an exon, either in a coding or non-coding 
5 portion of the gene, the identity of the allelic variant can be determined by determining the 
molecular structure of the mRNA, pre-mRNA, or cDNA. The molecular structure can be 
determined using any of the above described methods for determining the molecular 
structure of the genomic DNA, e.g., see Example 1 . 

The methods described herein may be performed, for example, by utilizing pre- 
;,,iL 10 packaged diagnostic kits, such as those described above, comprising at least one probe or 

primer nucleic acid described herein, which may be conveniently used, e.g., to determine 
1;^" whether a subject is or is not at risk of developing a disease associated with a specific 

'4 THBS2, ACE, or FGB allelic variant. 

5 Sample nucleic acid to be analyzed by any of the above-described diagnostic and 

^ 15 prognostic methods can be obtained from any cell type or tissue of a subject. For example, a 

i y subject's bodily fluid {e.g. blood) can be obtained by known techniques (e.g. venipuncture). 

Ip Alternatively, nucleic acid tests can be performed on dry samples (e.g. hair or skin). Fetal 

i nucleic acid samples can be obtained fi:om matemal blood as described in International 

Patent Application No. WO91/07660 to Bianchi. Alternatively, amniocytes or chorionic viUi 
20 may be obtained for performing prenatal testing. 

Diagnostic procedures may also be performed in situ directly upon tissue sections 
(fixed and/or frozen) of subject tissue obtained from biopsies or resections, such that no 
nucleic acid purification is necessary. Nucleic acid reagents may be used as probes and/or 
primers for such in situ procedures (see, for example, Nuovo, G.J., 1992, PGR in situ 
25 hybridisation: protocols and applications. Raven Press, NY). 

In addition to methods which focus primarily on the detection of one nucleic acid 
sequence, profiles may also be assessed in such detection schemes. Fingerprint profiles may 
be generated, for example, by utilizing a differential display procedure, Northem analysis 
and/or RT-PCR. 
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B. Pharmacogenomics 

Knowledge of the identity of the allele of one or more THBS2, ACE, and/or FGB 
gene polymorphic regions in a subject (the THBS2, ACE, and/or FGB genetic profile), alone 
5 or in conjunction with information of other genetic defects associated with the same disease 
(the genetic profile of the particular disease) also allows selection and customization of the 
therapy, e.g., a particular clinical course of therapy and/or further diagnostic evaluation for a 
particular disease to the subject's genetic profile. For example, subjects having a specific 
allele of a THBS2, ACE, or FGB gene, singly or in combination, may or may not exhibit 
^ 10 symptoms of a particular disease or be predisposed to developing symptoms of a particular 

zj disease. Further, if those subjects are symptomatic, they may or may not respond to a certain 

drug, e.g., a specific therapeutic used in the treatment or prevention of a vascular disease or 
4 disorder, e.g., CAD or MI, such as, for example, beta blocker drugs, calcium channel blocker 

y 

2 drugs, and/or nitrate drugs, but may respond to another. Furthermore, they may or may not 

15 respond to other treatments, including, for example, use of devices for treatment of vascular 
U disease, or surgical and^r non-surgical courses of treatment. Moreover, if a subject does or 

Z does not exhibit symptoms of a particular disease, the subject may or may not benefit fi-om 

"2 fiirther diagnostic evaluation, including, for example, use of vascular imaging devices. 

Thus, generation of a THBS2, ACE, or FGB genetic profile, (e,g,, categorization of 
20 alterations in THBS2, ACE, or FGB genes which are associated v^th the development of a 
particular disease), from a population of subjects, who are symptomatic for a disease or 
condition that is caused by or contributed to by a defective and/or deficient THBS2, ACE, or 
FGB gene and/or protein (a THBS2, ACE, or FGB genetic population profile) and 
comparison of a subject's THBS2, ACE, or FGB profile to the population profile, permits the 
25 selection or design of drugs that are expected to be safe and efficacious for a particular 

subject or subject population (i.e,, a group of subjects having the same genetic alteration), as 
well as the selection or design of a particular clinical course of therapy or further diagnostic 
evaluations that are expected to be safe and efficacious for a particular subject or subject 
population. 
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For example, a THBS2, ACE, or FOB population profile can be performed by 
determining the THBS2, ACE, or FGB profile, e.g., the identity of THBS2, ACE, or FOB 
alleles, in a subject population having a disease, which is associated with one or more 
specific alleles of THBS2, ACE, or FGB polymorphic regions. Optionally, the THBS2, 
5 ACE, or FGB population profile can fiirther include information relating to the response of 
the population to a THBS2, ACE, or FGB therapeutic, using any of a variety of methods, 
including, monitoring: 1) the severity of symptoms associated with the THBS2, ACE, or 
FGB related disease; 2) THBS2, ACE, or FGB gene expression level; 3) THBS2, ACE, or 
FGB mRNA level; and/or 4) THBS2, ACE, or FGB protein level, and dividing or 

1 0 categorizing the population based on particular THBS2, ACE, or FGB alleles. The THBS2, 
ACE, or FGB genetic population profile can also, optionally, indicate those particular 
THBS2, ACE, or FGB alleles which are present in subjects that are either responsive or non- 
responsive to a particular therapeutic, clinical course of therapy, or diagnostic evaluation. 
This information or population profile, is then useful for predicting which individuals should 

15 respond to particular drugs, particular clinical courses of therapy, or diagnostic evaluations 
based on their individual THBS2, ACE, or FGB genetic profile. 

In a preferred embodiment, the THBS2, ACE, or FGB profile is a transcriptional or 
expression level profile and is comprised of determining the expression level of THBS2, 
ACE, or FGB proteins, alone or in conjunction with the expression level of other genes 

20 known to contribute to the same disease at various stages of the disease. 

Pharmacogenomic studies can also be performed using transgenic animals. For 
example, one can produce transgenic mice, e.g., as described herein, which contain a specific 
allelic variant of a THBS2, ACE, or FGB gene. These mice can be created, e.g, by replacing 
their wild-type THBS2, ACE, or FGB gene with an allele of the human THBS2, ACE, or 

25 FGB gene. The response of these mice to specific THBS2, ACE, or FGB particular 
therapeutics, clinical courses of treatment, and/or diagnostic evaluations can then be 
determined. 
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(i) Diagnostic Evaluation 

In one embodiment, the polymorphisms of the present invention are used to 
determine the most appropriate diagnostic evaluation and to determine whether or not a 
subject will benefit from further diagnostic evaluation. For example, if a subject has pattem 
5 1 or pattem 2 of the THBS2 SNPs, or the complements thereof, as described herein, that 

subject has a decreased risk for vascular disease. Likewise, if a subject has one copy of an A 
and one copy of a G at nucleotide residue 86408 of the ACE reference sequence GI 
13027555 (AG genotype), or the complement thereof, that subject is at a decreased risk for 
vascular disease. Likewise, if a subject has two copies of a T at nucleotide residue 51 19 of 
10 the FGB reference sequence GI 182597, or the complement thereof, that subject is at a 
decreased risk for vascular disease. In addition, if a subject has one copy of a T and one 
copy of a C at nucleotide residue 51 19 of the FGB reference sequence GI 182597, or the 
complement thereof, that subject is also at a decreased risk for vascular disease. Therefore, a 
subject having a decreased risk for vascular disease, identified by the presence of the alleles 
15 described above, would be less likely to require or benefit from further diagnostic evaluation 
for a vascular disease or disorder. 

Thus, in one embodiment, the invention provides methods for classifying a subject 
who or is or is not at risk for developing, a vascular disease or disorder as a candidate for 
further diagnostic evaluation for a vascular disease or disorder comprising the steps of 
20 determining the THBS2, ACE, and/or FGB genetic profile of the subject, comparing the 

subject's THBS2, ACE, and/or FGB genetic profile to a THBS2, ACE, and/or FGB genetic 
population profile, and classifying the subject based on the identified genetic profiles as a 
subject who is a candidate for further diagnostic evaluation for a vascular disease or disorder. 
In one embodiment, the subject's THBS2, ACE, and/or FGB genetic profile is 
25 determined by identifying the nucleotide at residue 3949 and/or residue 4476 of the reference 
sequence GI 307505 of tiie THBS2 gene (polymorphism ID Nos. G5755e5 and G5755e9, 
respectively), the nucleotide at residue 86408 of the reference sequence GI 13027555 of the 
ACE gene (polymorphism ID No. G765u2), the nucleotide at residue 5119 and/or residue 
8059 of the reference sequence GI 182597 of the FGB gene (polymorphism ID Nos. FGBul 
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and FGBu4, respectively). Methods of further diagnostic evaluation include use of vascular 
imaging devices such as, for example, angiography, cardiac ultrasound, coronary angiogram, 
magnetic resonance imagery, nuclear imaging, CT scan, myocardial perfusion imagery, or 
electrocardiogram, or may include genetic analysis, familial health history analysis, lifestyle 
5 analysis, exercise stress tests, or any combination thereof. 

In another embodiment, the invention provides methods for selecting an effective 
vascular imaging device as a diagnostic tool for a vascular disease or disorder comprising the 
steps of determining the THBS2, ACE, and/or FGB genetic profile of the subject; comparing 
the subject's THBS2, ACE, and/or FGB genetic profile to a THBS2, ACE, and/or FGB 
10 genetic population profile; and selecting an effective vascular imaging device as a diagnostic 
Q tool for a vascular disease or disorder. In a preferred embodiment, the vascular imaging 

i!i device is selected from the group consisting of angiography, cardiac ultrasoxmd, coronary 

angiogram, magnetic resonance imagery, nuclear imaging, CT scan, myocardial perfusion 
I U imagery, electrocardiogram, or any combination thereof. 

15 

'::7s (ii) Clinical Coxxrse of Therapy 

'^f In another aspect, the polymorphisms of the present invention are used to determine 

□ the most appropriate clinical course of therapy for a subject who is at risk of a vascular 

' " * disease or disorder, and will aid in the determination of whether the subject will benefit from 

20 such clinical course of therapy, as determined by identification of one or both of the 
polymorphisms of the invention. 

In one aspect, the invention relates to the SNPs identified as described herein, both 
singly or in combination, as well as to the use of these SNPs, and others in these genes, 
particularly those nearby in linkage disequilibrium with these SNPs, both singly and in 
25 combination, for prediction of a particular clinical course of therapy for a subject who has, or 
is or is not at risk for developing, a vascular disease. In one embodiment, the invention 
provides a method for determining whether a subject will or will not benefit from a particular 
course of therapy by determining the presence of one, or both of the identities of the 
polymorphisms of the invention. For example, the determination of the polymorphisms of 
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the invention, singly, or in combination, will aid in the determination of whether a subject 
will benefit from surgical revascularization and/or will benefit by the implantation of a stent 
following sxirgicai revascularization, and v^U aid in the determination of the likelihood of 
success or failure of a particular clinical course of therapy. 
5 For example, a subject having "pattern 1," which comprises two copies of the variant 

allele of G5755e9 (CC) in combination with two copies of the reference allele of G5755e5 
(TT), or the complement thereof, or "pattern T\ which comprises two copies of the reference 
allele of G5755e9 (TT) and two copies of the variant allele of G5755e5 (GG), or the 
complement thereof, is at approximately 3-fold decreased odds of vascular disease. 

10 A subject having one copy of an A and one copy of a G at nucleotide residue 86408 

of the ACE reference sequence GI 13027555 (AG genotype), or the complement thereof, is 
at a decreased risk for vascular disease. 

A subject having two copies of a T at nucleotide residue 5119 of the FGB reference 
sequence GI 182597, or the complement thereof, is at a decreased risk for vascular disease, 

15 and a subject having one copy of a T and one copy of a C at nucleotide residue 51 19 of the 
FGB reference sequence GI 182597, or the complement thereof, is also at a decreased risk 
for vascular disease. Also, a subject having two copies of an A at nucleotide residue 8059 of 
the FGB reference sequence GI 182597, or the complement thereof, is at a decreased risk for 
vascular disease. A subject having one copy of an A and one copy of a G at nucleotide 

20 residue 5 11 9 of the FGB reference sequence GI 1 82597, or the complement thereof, is also at 
a decreased risk for vascular disease (see Example 1). Therefore, a subject with these 
specific alleles would be less likely to require or benefit from any clinical course of therapy. 

An appropriate clinical course of therapy may include, for example, a lifestyle 
change, including, for example, a change in diet or environment. Other clinical courses of 

25 therapy include, but are not limited to, use of surgical procedures or medical devices. 
Surgical procedures used for the treatment of vascular disorders, includes, for example, 
surgical revascularization, such as angioplasty, e.g,, percutaneous transluminal coronary 
balloon angioplasty (PTCA), or laser angioplasty, or coronary bypass grafting (CABG). 
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Medical devices used in the treatment or prevention of vascular diseases or disorders, 
include, for example, a stent, a defibrillator, a pacemaker, or any combination thereof. 

C. Monitoring Effects of THBS2, ACE, or FGB Therapeutics DvirinR Clinical Trials 
5 The present invention provides a method for monitoring the effectiveness of 

treatment of a subject with an agent {e,g., an agonist, antagonist, peptidomimetic, protein, 
peptide, nucleic acid, small molecule, or other drug candidate identified, e.g., by the 
screening assays described herein) comprising the steps of (i) obtaining a preadministration 
sample fi-om a subject prior to administration of the agent; (ii) detecting the level of 

1 0 expression or activity of a THBS2, ACE, or FGB protein, mRNA or gene in the 

preadministration sample; (iii) obtaining one or more post-administration samples from the 
subject; (iv) detecting the level of expression or activity of the THBS2, ACE, or FGB 
protein, mRNA or gene in the post-administration samples; (v) comparing the level of 
expression or activity of the THBS2, ACE, or FGB protein, mRNA, or gene in the 

1 5 preadministration sample with those of the THBS2, ACE, or FGB protein, mRNA, or gene in 
the post administration sample or samples; and (vi) altering the administration of the agent to 
the subject accordingly. For example, increased administration of the agent may be desirable 
to increase the expression or activity of THBS2, ACE, or FGB to higher levels than detected, 
z.e., to increase the effectiveness of the agent. Alternatively, decreased administration of the 

20 agent may be desirable to decrease expression or activity of THBS2, ACE, or FGB to lower 
levels than detected, i.e., to decrease the effectiveness of the agent. 

Cells of a subject may also be obtained before and after administration of a THBS2, 
ACE, or FGB therapeutic to detect the level of expression of genes other than THBS2, ACE, 
or FGB, to verify that the THBS2, ACE, or FGB therapeutic does not increase or decrease 

25 the expression of genes which could be deleterious. This can be done, e.g., by using the 
method of transcriptional profiling. Thus, mRNA from cells exposed in vivo to a THBS2, 
ACE, or FGB therapeutic and mRNA from the same type of cells that were not exposed to 
the THBS2, ACE, or FGB therapeutic could be reverse transcribed and hybridized to a chip 
containing DNA from numerous genes, to thereby compare the expression of genes in cells 
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treated and not treated with a THBS2, ACE, or FGB therapeutic. If, for example a THBS2, 
ACE, or FGB therapeutic turns on the expression of a proto-oncogene in a subject, use of this 
particular THBS2, ACE, or FGB therapeutic may be undesirable. 

5 D. Methods of Treatment 

The present invention provides for both prophylactic and therapeutic methods of 
treating a subject having or likely to develop a disorder associated with specific THBS2, 
ACE, or FGB alleles and/or aberrant THBS2, ACE, or FGB expression or activity, e.g., 
vascular diseases or disorders. 

U 10 

;;5 i) Prophylactic Methods 

In one aspect, the invention provides a method for preventhig a disease or disorder 
%J associated with a specific THBS2, ACE, or FGB allele such as a vascular disease or disorder, 

e.g., CAD or MI, and medical conditions resulting therefrom, by administering to the subject 
J" 15 an agent which counteracts the unfavorable biological effect of the specific THBS2, ACE, or 

[U FGB allele. Subjects at risk for such a disease can be identified by a diagnostic or prognostic 

,p assay, e.g., as described herein. Administration of a prophylactic agent can occur prior to the 

i t; manifestation of symptoms associated with specific THBS2, ACE, or FGB alleles, such that 

a disease or disorder is prevented or, altematively, delayed in its progression. Depending on 
20 the identity of the THBS2, ACE, or FGB allele in a subject, a compound that counteracts the 
effect of this allele is administered. The compound can be a compound modulating the 
activity of THBS2, ACE, or FGB, e.g., a THBS2, ACE, or FGB inhibitor. The treatment can 
also be a specific lifestyle change, e.g., a change in diet or an environmental alteration. In 
particular, the treatment can be undertaken prophylactically, before any other symptoms are 
25 present. Such a prophylactic treatment could thus prevent the development of aberrant 

vascular activity, e.g, the production of atherosclerotic plaque leading to, e,g, CAD or ML 
The prophylactic methods are similar to therapeutic methods of the present invention and are 
further discussed in the following subsections. 
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(ii) Therapeutic Methods 

The invention further provides methods of treating a subject having a disease or 
disorder associated with a specific allelic variant of a polymorphic region of a THBS2, ACE, 
or FGB gene. Preferred diseases or disorders include vascular diseases and disorders, and 
5 disorders resulting therefrom (e.g., such as, for example, atherosclerosis, CAD, MI, ischemia, 
stroke, peripheral vascular diseases, venous thromboembolism and pulmonary embolism). 

In one embodiment, the method comprises (a) determining the identity of an allelic 
variant of a one or more of a THBS2, ACE, and/or FGB; and (b) administering to the subject 
a compound that compensates for the effect of the specific allelic variant(s). The 
10 polymorphic region can be localized at any location of the gene, e.g., in a regulatory element 
(e.g., in a 5' upstream regulatory element), in an exon, (e.g., coding region of an exon), in the 
3' UTR, in an intron, or at an exon/intron border. Thus, depending on the site of the 
polymorphism in the THBS2, ACE, or FGB gene, a subject having a specific variant of the 
polymorphic region which is associated with a specific disease or condition, can be treated 
15 with compovmds which specifically compensate for the effect of the allelic variant. 

In a preferred embodiment, the identity of one or more of the following nucleotides 
of a THBS2, ACE, or FGB gene of a subject is determined: the nucleotide at residue 3949 
and/or residue 4476 of the reference sequence GI 307505 of the THBS2 gene (polymorphism 
ID Nos. G5755e5 and G5755e9, respectively), the nucleotide at residue 86408 of the 
20 reference sequence GI 13027555 of the ACE gene (polymorphism ID No. G765u2), the 
nucleotide at residue 5119 and/or residue 8059 of the reference sequence GI 182597 of the 
FGB gene (polymorphism ID Nos. FGBul and FGBu4, respectively). In a preferred 
embodiment, the identities of one or more nucleotides is determined. 

For example, a subject having "pattem 1," which comprises two copies of the variant 
25 allele of G5755e9 (CC) in combination with two copies of the reference allele of G5755e5 
(TT), or the complement thereof, or "pattern 2", which comprises two copies of the reference 
allele of G5755e9 (TT) and two copies of the variant allele of G5755e5 (GG), or the 
complement thereof, is at approximately 3-fold decreased odds of vascular disease. 

A subject having one copy of an A and one copy of a G at nucleotide residue 86408 
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of the ACE reference sequence GI 13027555 (AG genotype), or the complement thereof, is 
at a decreased risk for vascular disease. 

A subject having two copies of a T at nucleotide residue 51 19 of the FGB reference 
sequence GI 182597, or the complement thereof, is at a decreased risk for vascular disease, 
5 and a subject having one copy of a T and one copy of a C at nucleotide residue 51 19 of the 
FGB reference sequence GI 182597, or the complement thereof, is also at a decreased risk 
for vascular disease. Also, a subject having two copies of an A at nucleotide residue 8059 of 
the FGB reference sequence GI 182597, or the complement thereof, is at a decreased risk for 
vascular disease. A subject having one copy of an A and one copy of a G at nucleotide 

1 0 residue 5 1 1 9 of the FGB reference sequence GI 1 82597, or the complement thereof, is also at 
a decreased risk for vascular disease. 

Generally, the allelic variant can be a mutant allele, le., an allele which when 
present in one, or two copies, in a subject results in a change in the phenotype of the subject. 
A mutation can be a substitution, deletion, and/or addition of at least one nucleotide relative 

15 to the wild-type allele {i.e., the reference sequence). Depending on where the mutation is 
located in the THBS2, ACE, or FGB gene, the subject can be treated to specifically 
compensate for the mutation. For example, if the mutation is present in the coding region of 
the gene and results in a more active THBS2, ACE, or FGB protein, the subject cm be 
treated, e.g., by administration to the subject of a medication or course of clinical treatment 

20 which treat, prevents, or ameliorates a vascular disease or disorder. Normal THBS2, ACE, 
or FGB protein can also be used to counteract or compensate for the endogenous mutated 
form of the THBS2, ACE, or FGB protein. Normal THBS2, ACE, or FGB protein can be 
directly delivered to the subject or indirectly by gene therapy wherein some cells in the 
subject are transformed or transfected with an expression construct encoding wild-type 

25 THBS2, ACE, or FGB protein. Nucleic acids encoding reference human THBS2, ACE, or 
FGB protein are set forth in SEQ ID NOs.:l, 3, and 5, respectively (GI Accession Nos. 
307505, 13027555, and 182597, respectively). 

Yet in another embodiment, the invention provides methods for treating a subject 
having a mutated THBS2, ACE, or FGB gene, in which the mutation is located in a 
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regulatory region of the gene. Such a regulatory region can be localized in the 5' upstream 
regulatory element of the gene, in the 5' or 3' untranslated region of an exon, or in an intron. 
A mutation in a regulatory region can result in increased production of THBS2, ACE, or 
FGB protein, decreased production of THBS2, ACE, or FOB protein, or production of 
5 THBS2, ACE, or FGB having an aberrant tissue distribution. The effect of a mutation in a 
regulatory region upon the THBS2, ACE, or FGB protein can be determined, e.g., by 
measuring the THBS2, ACE, or FGB protein level or mRNA level in cells having a THBS2, 
ACE, or FGB gene having this mutation and which, normally (i.e., in the absence of the 
mutation) produce THBS2, ACE, or FGB protein. The effect of a mutation can also be 

10 determined in vitro. For example, if the mutation is in the 5' upstream regulatory element, a 
reporter construct can be constructed which comprises the mutated 5' upstream regxilatory 
element linked to a reporter gene, the construct transfected into cells, and comparison of the 
level of expression of the reporter gene under the control of the mutated 5' upstream 
regulatory element and under the control of a wild-type 5' upstream regulatory element. 

15 Such experiments can also be carried out in mice transgenic for the mutated 5' upstream 

regulatory element. If the mutation is located in an intron, the effect of the mutation can be 
determined, e.g., by producing transgenic animals in which the mutated THBS2, ACE, or 
FGB gene has been introduced and in which the wild-type gene may have been knocked out. 
Comparison of the level of expression of THBS2, ACE, or FGB in the mice transgenic for 

20 the mutant human THBS2, ACE, or FGB gene with mice transgenic for a v^ld-type human 
THBS2, ACE, or FGB gene will reveal whether the mutation results in increased, or 
decreased synthesis of the THBS2, ACE, or FGB protein and/or aberrant tissue distribution 
of THBS2, ACE, or FGB protein. Such analysis could also be performed in cultured cells, in 
which the human mutant THBS2, ACE, or FGB gene is introduced and, e.g., replaces the 

25 endogenous wild-type THBS2, ACE, or FGB gene in the cell. Thus, depending on the effect 
of the mutation in a regulatory region of a THBS2, ACE, or FGB gene, a specific treatment 
can be administered to a subject having such a mutation. Accordingly, if the mutation results 
in increased THBS2, ACE, or FGB protein levels, the subject can be treated by 
administration of a compound which reduces THBS2, ACE, or FGB protein production, e.g., 
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by reducing THBS2, ACE, or FGB gene expression or a compound which inhibits or reduces 
the activity of THBS2, ACE, or FGB. 

A correlation between drug responses and specific alleles of THBS2, ACE, or FGB 
can be shown, for example, by clinical studies wherein the response to specific drugs of 
5 subjects having different allelic variants of a polymorphic region of a THBS2, ACE, or FGB 
gene is compared. Such studies can also be performed using animal models, such as mice 
having various alleles of human THBS2, ACE, or FGB genes and in which, e.g., the 
endogenous THBS2, ACE, or FGB has been inactivated such as by a knock-out mutation. 
Test drugs are then administered to the mice having different human THBS2, ACE, or FGB 
10 alleles and the response of the different mice to a specific compound is compared. 

Accordingly, the invention provides assays for identifying the drug which will be best suited 
for treating a specific disease or condition in a subject. For example, it will be possible to 
select drugs which will be devoid of toxicity, or have the lowest level of toxicity possible for 
treating a subject having a disease or condition. 

15 

Other Uses For the Nucleic Acid Molecules of the Invention 

The identification of different alleles of THBS2, ACE, or FGB can also be useful for 
identifying an individual among other individuals from the same species. For example, DNA 
sequences can be used as a fingerprint for detection of different individuals within the same 
20 species (Thompson, J. S. and Thompson, eds.. Genetics in Medicine, WB Saunders Co., 
Philadelphia, PA (1991)). This is useful, for example, in forensic studies and paternity 
testing, as described below. 



A. Forensics 

25 Determination of which specific allele occupies a set of one or more polymorphic 

sites in an individual identifies a set of polymorphic forms that distinguish the individual 
from others in the population. See generally National Research Council, The Evaluation of 
Forensic DNA Evidence (Eds. Pollard ef a/.. National Academy Press, DC, 1996). The more 
polymorphic sites that are analyzed, the lower the probability that the set of polymorphic 
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forms in one individual is the same as that in an unrelated individual. Preferably, if multiple 
sites are analyzed, the sites are unlinked. Thus, the polymorphisms of the invention can be 
used in conjunction with known polymorphisms in distal genes. Preferred polymorphisms 
for use in forensics are biallelic because the population frequencies of two polymorphic 
forms can usually be determined with greater accuracy than those of multiple polymorphic 
forms at multi-allelic loci. 

The capacity to identify a distinguishing or xmique set of polymorphic markers in an 
individual is useful for forensic analysis. For example, one can determine whether a blood 
sample from a suspect matches a blood or other tissue sample from a crime scene by 
determining whether the set of polymorphic forms occupymg selected polymorphic sites is 
the same in the suspect and the sample. If the set of polymorphic markers does not match 
between a suspect and a sample, it can be concluded (barring experimental error) that the 
suspect was not the source of the sample. If the set of markers is the same in the sample as in 
the suspect, one can conclude that the DNA from the suspect is consistent with that found at 
the crime scene. If frequencies of the polymorphic forms at the loci tested have been 
determined (e.g., by analysis of a suitable population of individuals), one can perform a 
statistical analysis to determine the probability that a match of suspect and crime scene 
sample would occur by chance. 

p(ID) is the probability that two random individuals have the same polymorphic or 
allelic form at a given polymorphic site. For example, hi biallelic loci, four genotypes are 
possible: AA, AB, BA, and BB. If alleles A and B occur in a haploid genome of the 
organism with frequencies x and y, the probability of each genotype in a diploid organism is 
(see WO 95/12607): 

Homozygote: p(AA)= x^ 

Homozygote: p(BB)= f = (1-x)^ 

Single Heterozygote: p(AB)== p(BA)= xy = x(l-x) 

Both Heterozygotes: p(AB+BA)= 2xy = 2x(l-x) 



The probability of identity at one locus (i.e., the probability that two individuals, 
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picked at random from a population will have identical polymorphic forms at a given locus) 
is given by the equation: p(ID) = (x^). 

These calculations can be extended for any number of polymorphic forms at a given 
locus. For example, the probability of identity p(ID) for a 3-allele system where the alleles 
5 have the frequencies in the population of x, y, and z, respectively, is equal to the sum of the 
squares of the genotype frequencies: P(ID) = x"* + (2xy) ^ + (2yz) ^ + (2xz) ^ + z"^ + y"^. 

In a locus of n alleles, the appropriate binomial expansion is used to calculate p(ID) 
and p(exc). 

The cumulative probability of identity (cum p(ID)) for each of multiple unlinked loci 
10 is determined by multiplying the probabilities provided by each locus: 

cum p(ID) = p(IDl)p(ID2)p(ID3). . .p(IDn). 

The cumulative probability of non-identity for n loci (i.e., the probability that two 

random individuals will be difference at 1 or more loci) is given by the equation: 

cum p(nonlD) = 1-cum p(ID). 
15 If several polymorphic loci are tested, the cumulative probability of non-identity for 

random individuals becomes very high (e.g., one billion to one). Such probabilities can be 

taken into accoimt together with other evidence in determining the guilt or innocence of the 

suspect. 



20 B. Paternity Testing 

The object of paternity testing is usually to determine whether a male is the father of 
a child. In most cases, the mother of the child is known, and thus, it is possible to trace the 
mother's contribution to the child's genotype. Patemity testing investigates whether the part 
of the child's genotype not attributable to the mother is consistent to that of the putative 

25 father. Patemity testing can be performed by analyzing sets of polymorphisms in the 
putative father and in the child. 

If the set of polymorphisms in the child attributable to the father does not match the 
set of polsonorphisms of the putative father, it can be concluded, barring experimental error, 
that that putative father is not the real father. If the set of polymorphisms in the child 
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attributable to the father does match the set of polymorphisms of the putative father, a 
statistical calculation can be performed to determine the probability of a coincidental match. 

The probability of parentage exclusion (representing the probability that a random 
male will have a polymorphic form at a given polymorphic site that makes him incompatible 

5 as the father ) is given by the equation (see WO 95/12607): p(exc) = xy(l-xy), where x and y 
are the population frequencies of alleles A and B of a biallelic polymorphic site. 

(At a triallelic site p(exc) = xy(l-xy) + yz(l-yz) + xz(l-xz) + 3xyz(l-xyz)), where x, 
y, and z and the respective populations frequencies of alleles A, B, and C). 
The probability of non-exclusion is: p(non-exc) = l-p(exc), 
10 The cumulative probability of non-exclusion (representing the values obtained when 

n loci are is used) is thus: 

Cum p(non-exc) = p(non-excl)p(non-exc2)p(non-exc3). . .p(non-excn). 
The cumulative probability of the exclusion for n loci (representing the probability 
that a random male will be excluded: cum p(exc) = 1 - cum p(non-exc), 
15 If several polymorphic loci are included in the analysis, the cumulative probability of 

exclusion of a random male is very high. This probability can be taken into account in 
assessing the liability of a putative father whose polymorphic marker set matches the child's 
polymorphic marker set attributable to his or her father. 

20 C. Kits 

As set forth herein, the invention provides methods, e,g., diagnostic and therapeutic 
methods, e,g., for determining the type of allelic variant of a polymorphic region present in a 
THBS2, ACE, or FGB gene, such as a human THBS2, ACE, or FOB gene. In preferred 
embodiments, the methods use probes or primers comprising nucleotide sequences which are 
25 complementary polymorphic region of a THBS2, ACE, or FGB gene (SBQ ID N0s:5, 6, 7, 
8, 9, 10, and 11). Accordingly, the invention provides kits for performing these methods. 

In a preferred embodiment, the invention provides a kit for determining whether a 
subject is or is not at risk of developing a disease or condition associated with a specific 
allelic variant of a THBS2, ACE, or FGB polymorphic region. In an even more preferred 
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embodiment, the disease or disorder is characterized by an abnormal THBS2, ACE, or FGB 
activity. In an even more preferred embodiment, the invention provides a kit for determining 
whether a subject is or is not at risk of developing a vascular disease, e.g., atherosclerosis, 
CAD, MI, ischemia, stroke, peripheral vascular diseases, venous thromboembolism and 
5 pulmonary embolism. 

A preferred kit provides reagents for determining whether a subject is or is not likely 
to develop a vascular disease, e.g., CAD or MI. 

Preferred kits comprise at least one probe or primer which is capable of specifically 
hybridizing under stringent conditions to a THBS2, ACE, or FGB reference sequence or 
10 polymorphic region and instructions for use. The kits preferably comprise at least one of the 
above described nucleic acids. Preferred kits for amplifying at least a portion of a THBS2, 
ACE, or FGB gene, comprise at least one primer pair which is capable of hybridizing to an 
■'4 allelic variant sequence of a THBS2, ACE, or FGB gene. The kits of the invention can also 

'si. J 

I'll comprise one or more control nucleic acids or reference nucleic acids. For example, a kit can 

1 5 comprise primers for amplifying a polymorphic region of a THBS2, ACE, or FGB gene and 

i'^^' a control DNA corresponding to such an amplified DNA and having the nucleotide sequence 

I'll 

of a specific allelic variant. Thus, direct comparison can be performed between the DNA 
% amplified from a subject and the DNA having the nucleotide sequence of a specific allelic 

■1 

variant. In one embodiment, the control nucleic acid comprises at least a portion of a 
20 THBS2, ACE, or FGB gene of an individual who does not have a vascular disease, or a 
disease or disorder associated vrfth an aberrant THBS2, ACE, or FGB activity. In another 
embodiment, the control nucleic acid comprises at least a portion of a THBS2, ACE, or FGB 
gene of an individual who does have a vascular disease, or a disease or disorder associated 
with an aberrant THBS2, ACE, or FGB activity. In yet another embodiment, the control 
25 nucleic acid comprises a reference sequence of a THBS2, ACE, or FGB gene. 

Yet other kits of the invention comprise at least one reagent necessary to perform the 
assay. For example, the kit can comprise an enzyme. Alternatively the kit can comprise a 
buffer or any other necessary reagent. 



-75- 



Atty. Docket No.: MMI-004 



D. Electronic Apparatus Readable Media and Arrays 

Electronic apparatus readable media comprising a polymorphism of the present 
invention is also provided. As used herein, "electronic apparatus readable media" and 
"computer readable media," which are used interchangeably herein, refer to any suitable 
5 medium for storing, holding or containing data or information that can be read and accessed 
directly by an electronic apparatus. Such media can include, but are not limited to: magnetic 
storage media, such as floppy discs, hard disc storage medium, and magnetic tape; optical 
storage media such as compact disc; electronic storage media such as RAM, ROM, EPROM, 
EEPROM and the like; general hard disks and hybrids of these categories such as 
10 magnetic/optical storg^e media. The medium is adapted or configured for having recorded 
Q thereon a polymorphism of the present invention. 

•ps: As used herein, the term "electronic apparatus" is intended to include any suitable 

1 computing or processing apparatus or other device configured or adapted for storing data or 

' information. Examples of electronic apparatus suitable for use with the present invention 

u 15 include stand-alone computing apparatus; networks, including a local area network (LAN), a 
i ll wide area network (WAN) Internet, Intranet, and Extranet; electronic appliances such as a 

^'l! personal digital assistants (PDAs), cellular phone, pager and the like; and local and 

Q distributed processing systems. 

As used herein, "recorded" refers to a process for storing or encoding information on 
20 the electronic apparatus readable medium. Those skilled in the art can readily adopt any of 
the presently known methods for recording information on known media to generate 
manufactures comprising the polymorphisms of the present invention. 

A variety of software programs and formats can be used to store the polymorphism 
information of the present invention on the electronic apparatus readable medium. For 
25 example, the polymorphic sequence can be represented in a word processing text file, 

formatted in commercially-available software such as WordPerfect and Microsoft Word, or 
represented in the form of an ASCII file, stored in a database application, such as DB2, 
Sybase, Oracle, or the like, as well as in other forms. Any number of data processor 
structuring formats (e.g., text file or database) may be employed in order to obtain or create a 
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medium having recorded thereon the markers of the present invention. 

By providing the polymorphisms of the invention in readable form, one can routinely 
access the polymorphism information for a variety of purposes. For example, one skilled in 
the art can use the sequences of the polymorphisms of the present invention in readable form 
5 to compare a target sequence or target structural motif with the sequence information stored 
within the data storage means. Search means are used to identify fragments or regions of the 
sequences of the invention which match a particular target sequence or target motif. 

The present invention therefore provides a medium for holding instructions for 
performing a method for determining whether or not a subject has a vascular disease or a pre- 
10 disposition to a vascular disease, wherein the method comprises the steps of determining the 
i; 3 presence or absence of a polymorphism and based on the presence or absence of the 

■/I polymorphism, determining whether the subject has a vascular disease or a pre-disposition to 

a vascular disease and/or recommending a particular clinical course of therapy or diagnostic 
\U evaluation for the vascular disease or pre-vascular disease condition. 

J, IS The present invention further provides in an electronic system comprising a processor 

j^J j and/or in a network, a method for determining whether or not a subject has a vascular disease 

f; or a pre-disposition to vascular disease associated with a polymorphism as described herein 

i;;3 wherein the method comprises the steps of determining the presence or absence of the 

polymorphism, and based on the presence or absence of the polymorphism, determining 
20 whether the subject has a vascular disease or a pre-disposition to a vascular disease, suid/or 
recommending a particular treatment for the vascular disease or pre-vascular disease 
condition. In one embodiment, the processor implements the functionality of obtaining 
information from the subject indicative of the presence or absence of the polymorphic region. 
In another embodiment, the processor further implements the functionality of receiving 
25 phenotypic information associated with the subject. In yet another embodiment, the 
processor further implements the functionality of acquiring from a network phenotypic 
information associated with the subject. The method may further comprise the step of 
receiving phenotypic information associated with the subject and/or acquiring from a 
network phenotypic information associated with the subject. 
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The present invention also provides in a network, a method for determining whether 
or not a subject has vascular disease or a pre-disposition to vascular disease associated with a 
polymorphism, said method comprising the steps of receiving information associated with 
the polymorphism, receiving phenotypic information associated with the subject, acquiring 
5 information from the network corresponding to the polymorphism and/or vascular disease, 
and based on one or more of the phenotypic information, the polymorphism, and the acquired 
information, determining whether or not the subject has a vascular disease or a pre- 
disposition to a vascular disease. The method may further comprise the step of 
recommending a particular treatment for the vascular disease or pre-vascular disease 
10 condition. 

The present invention also provides a method for determining whether or not a 
subject has a vascular disease or a pre-disposition to a vascular disease, said method 
comprising the steps of receiving information associated with the polymorphism, receiving 
phenotypic information associated with the subject, acquiring information from the network 
15 corresponding to the polymorphism and/or vascular disease, and based on one or more of the 
phenotypic information, the polymorphism, and the acquired information, determining 
whether the subject has vascular disease or a pre-disposition to vascular disease. The method 
may further comprise the step of recommending a particular treatment for the vascular 
disease or pre-vascular disease condition. 

20 

E. Personalized Health Assessment 

Methods and systems of assessing personal health and risk for disease, e.g., vascular 
disease, in a subject, using the polymorphisms and associations of the instant invention are 
also provided. The methods provide personalized health care knowledge to individuals as 
25 well as to their health care providers, as well as to health care companies. It will be 

appreciated that the term '"health care providers" is not limited to physicians but can be any 
source of health care. The methods and systems provide personalized information including 
a personal health assessment report that can include a personalized molecular profile, e,g., a 
THBS2, ACE, and/or FGB genetic profile, a health profile, or both. Overall, the methods 
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and systems as described herein provide personalized information for individuals and patient 
management tools for healthcare providers and/or subjects using a variety of 
communications networks such as, for example, the Internet. U.S. Patent Application Serial 
No. 60/266,082, filed February 1, 2001, entitled "Methods and Systems for Personalized 
5 Health Assessment," further describes personalized health assessment methods, systems, and 
apparatus, and is expressly incorporated herein by reference. 

In one aspect, the invention provides an Internet-based method for assessing a 
subject's risk for vascular disease, e.g., CAD or ML In one embodiment, the method 
comprises obtaining information from the subject regarding the polymorphic region of an F7 
10 gene, through e.g., obtaining a biological sample from a subject, analyzing the biological 
i;l sample to determine the presence or absence of a polymorphic region of THBS2, ACE, 

j''! and/or FOB, and providing results of the analysis to the subject via the Internet, wherein the 

presence of a polymorphic region of THBS2, ACE, and/or FGB indicates a decreased risk for 
ry vascular disease. In another embodiment, the method comprises analyzing data from a 

./ 1 5 biological sample from a subject relating to the presence or absence of a polymorphic region 

i . . " 

of THBS2, ACE, and/or FGB and providing results of the analysis to the subject via the 
i f Internet, wherein the presence of a pols^norphic region of THBS2, ACE, and/or FGB 

i;3 indicates an a decreased risk for vascular disease. 

It will be appreciated that the phrase "wherein the presence of a polymorphic region 
20 of THBS2, ACE, and/or FGB indicates a decreased risk for vascular disease" includes a 
subject having "pattern 1," which comprises two copies of the variant allele of G5755e9 
(CC) in combination with two copies of the reference allele of G5755e5 (TT), or the 
complement thereof, or "pattern 2", which comprises two copies of the reference allele of 
G5755e9 (TT) and two copies of the variant allele of G5755e5 (GG), or the complement 
25 thereof, which indicates that the subject is at approximately 3 -fold decreased odds of having 
or developing a vascular disease. This phrase also includes a subject having one copy of an 
A and one copy of a G at nucleotide residue 86408 of the ACE reference sequence GI 
13027555 (AG genotype), or the complement thereof, which indicates that the subject is at a 
decreased risk for having or developing a vascular disease. This phrase also includes a 
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subject having two copies of a T at nucleotide residue 5 1 1 9 of the FGB reference sequence 
GI 182597, or the complement thereof, which indicates that the subject is at a decreased risk 
for having or developing a vascular disease, and a subject having one copy of a T and one 
copy of a C at nucleotide residue 51 19 of the FGB reference sequence GI 182597, or the 
5 complement thereof, which indicates that the subject is also at a decreased risk for having or 
developing a vascular disease. Also, a subject having two copies of an A at nucleotide 
residue 8059 of the FGB reference sequence GI 182597, or the complement thereof, indicates 
that the subject is at a decreased risk for having or developing a vascular disease. A subject 
having one copy of an A and one copy of a G at nucleotide residue 51 19 of the FGB 
10 reference sequence GI 1 82597, or the complement thereof, indicates that the subject is also at 
;3 a decreased risk for having or developing a vascular disease (see Example 1), 

|.,L The terms '"Internet" and/or "communications network" as used herein refer to any 

J suitable communication link, which permits electronic communications. It should be 

imderstood that these terms are not limited to "the Intemet" or any other particular system or 
ii 15 type of communication link. That is, the terms "Intemet" and/or "communications network" 

refer to any suitable communication system, including extra-computer system and intra- 
computer system communications. Examples of such communication systems include 
i;3 intemal busses, local area networks, wide area networks, point-to-point shared and dedicated 

communications, infra-red links, microwave links, telephone links, CATV links, satellite and 
20 radio links, and fiber-optic links. The terms "Intemet" and/or "communications network" 
can also refer to any suitable communications system for sending messages between remote 
locations, directly or via a third party communication provider such as AT&T. In this 
instance, messages can be communicated via telephone or facsimile or computer synthesized 
voice telephone messages with or without voice or tone recognition, or any other suitable 
25 communications technique. 

In another aspect, the methods of the invention also provide methods of assessing a 
subject's risk for vascular disease, e.g., CAD or MI. In one embodiment, the method 
comprises obtaining information from the subject regarding the polymorphic region of an F7 
gene, through e.g., obtaining a biological sample from the individual, analyzing the sample 
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to obtain the subject's THBS2, ACE, and/or FOB genetic profile, representing the THBS2, 
ACE, and/or FGB genetic profile information as digital genetic profile data, electronically 
processing the THBS2, ACE, and/or FGB digital genetic profile data to generate a risk 
assessment report for vascular disease, and displaying the risk assessment report on an output 
5 device, where the presence of a polymorphic region of THBS2, ACE, and/or FGB indicates a 
decreased risk for vascular disease. In another embodiment, the method comprises analyzing 
a subject's THBS2, ACE, and/or FGB genetic profile, representing the THBS2, ACE, and/or 
FGB genetic profile information as digital genetic profile data, electronically processing the 
THBS2, ACE, and/or FGB digital genetic profile data to generate a risk assessment report for 

1 0 vascular disease, and displaying the risk assessment report on an output device, where the 

presence of a polymorphic region of THBS2, ACE, and/or FGB indicates a decreased risk for 
vascular disease, e.g., CAD or MI. Additional health information may be provided and can 
be utilized to generate the risk assessment report. Such information includes, but is not 
limited to, information regarding one or more of age, sex, ethnic origin, diet, sibling health, 

1 5 parental health, clinical symptoms, personal health history, blood test data, weight, and 
alcohol use, drug use, nicotine use, and blood pressure. 

The THBS2, ACE, and/or FGB digital genetic profile data may be transmitted via a 
communications network, e.g., the Internet, to a medical information system for processing. 
In yet another aspect the invention provides a medical information system for 

20 assessing a subject's risk for vascular disease comprising a means for obtaining information 
from the subject regarding the polymorphic region of an F7 gene, through e.g., obtaining a 
biological sample from the individual to obtain a THBS2, ACE, and/or FGB genetic profile, 
a means for representing the THBS2, ACE, and/or FGB genetic profile as digital molecular 
data, a means for electronically processing the THBS2, ACE, and/or FGB digital genetic 

25 profile to generate a risk assessment report for vascular disease, and a means for displaying 
the risk assessment report on an output device, where the presence of a polymorphic region 
of THBS2, ACE, and/or FGB indicates a decreased risk for vascular disease. 

In another aspect, the invention provides a computerized method of providing 
medical advice to a subject comprising obtaining information from the subject regarding the 
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polymorphic region of an F7 gene, through e.g., obtaining a biological sample from the 
subject, analyzing the subject's biological sample to determine the subject's THBS2, ACE, 
and/or FGB genetic profile, and, based on the subject's THBS2, ACE, and/or FOB genetic 
profile, determining the subject's risk for vascular disease. Medical advice may be then 
5 provided electronically to the subject, based on the subject's risk for vascular disease. The 
medical advice may comprise, for example, recommending one or more of the group 
consisting of: further diagnostic evaluation, use of medical or surgical devices, 
administration of medication, or lifestyle change. Additional health information may also be 
obtained from the subject and may also be used to provide the medical advice. 
10 In another aspect, the invention includes a method for self-assessing risk for a 

1:3 vascular disease. The method comprises providing information from the subject regarding 

)!! the polymorphic region of an F7 gene, through e.g., providing a biological sample for genetic 

:, J analysis, and accessing an electronic output device displaying results of the genetic analysis, 

thereby self-assessing risk for a vascular disease, where the presence of a polymorphic region 
1 5 of THBS2, ACE, and/or FGB indicates a decreased risk for vascular disease, 
i'l I In another aspect, the invention provides a method of self-assessing risk for vascular 

disease comprising providing information from the subject regarding the polymorphic region 
i;3 of an F7 gene, through e.g., providing a biological sample, accessing THBS2, ACE, and/or 

FGB digital genetic profile data obtained from the biological sample, the THBS2, ACE, 
20 and/or FGB digital genetic profile data being displayed via an output device, where the 

presence of a polymorphic region of THBS2, ACE, and/or FGB indicates a decreased risk for 
vascular disease. 

An output device may be, for example, a CRT, printer, or website. An electronic 
output device may be accessed via the Internet. 
25 The biological sample may be obtained from the individual at a laboratory company. 

In one embodiment, the laboratory company processes the biological sample to obtain 
THBS2, ACE, and/or FGB genetic profile data, represents at least some of the THBS2, ACE, 
and/or FGB genetic profile data as digital genetic profile data, and transmits the THBS2, 
ACE, and/or FGB digital genetic profile data via a communications network to a medical 
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information system for processing. The biological sample may also be obtained from the 
subject at a draw station. A draw station processes the biological sample to obtain THBS2, 
ACE, and/or FGB genetic profile data and transfers the data to a laboratory company. The 
laboratory company then represents at least some of the THBS2, ACE, and/or FGB genetic 
5 profile data as digital genetic profile data, and transmits the THBS2, ACE, and/or FGB 
digital genetic profile data via a conrniunications network to a medical information system 
for processing. 

In another aspect, the invention provides a method for a health care provider to 
generate a personal health assessment report for an individual. The method comprises 

1 0 counseling the individual to provide a biological sample and authorizing a draw station to 
take a biological sample from the individual and transmit molecular information from the 
sample, to a laboratory company, where the molecular information comprises the presence or 
absence of a polymorphic region of THBS2, ACE, and/or FGB. The health care provider 
then requests the laboratory company to provide digital molecular data corresponding to the 

15 molecular information to a medical information system to electronically process the digital 
molecular data and digital health data obtained fi-om the individual to generate a health 
assessment report, receives the health assessment report from the medical information 
system, and provides the health assessment report to the individual. 

In still another aspect, the invention provides a method of assessing the health of an 

20 individual. The method comprises obtaining health information from the individual using an 
input device (e.g., a keyboard, touch screen, hand-held device, telephone, wireless input 
device, or interactive page on a website), representing at least some of the health information 
as digital health data, obtaining information from the subject regarding the polymorphic 
region of an F7 gene, through e.g., obtaining a biological sample from the individual, and 

25 processing the biological sample to obtain molecular information, where the molecular 
information comprises the presence or absence of a polymorphic region of THBS2, ACE, 
and/or FGB. At least some of the molecular information and health data is then presented as 
digital molecular data and electronically processed to generate a health assessment report. 
The health assessment report is then displayed on an output device. The health assessment 
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report can comprise a digital health profile of the individual. The molecular data can 
comprise protein sequence data, and the molecular profile can comprise a proteomic profile. 
The molecular data can also comprise information regarding one or more of the absence, 
presence, or level, of one or more specific proteins, polypeptides, chemicals, cells, 
organisms, or compounds in the individual's biological sample. The molecular data may also 
comprise, e.g., nucleic acid sequence data, and the molecular profile may comprise, e.g., a 
genetic profile. 

In yet another embodiment, the method of assessing the health of an individual 
further comprises obtaining a second biological sample or a second health information at a 
time after obtaining the initial biological sample or initial health information, processing the 
second biological sample to obtain second molecular information, processing the second 
health information, representing at least some of the second molecular information as digital 
second molecular data and second health information as digital health information, and 
processing the molecular data and second molecular data and health information and second 
health information to generate a health assessment report. In one embodiment, the health 
assessment report provides information about the individual's predisposition for vascular 
disease, e.g., CAD or MI, and options for risk reduction. 

Options for risk reduction comprise, for example, one or more of diet, exercise, one 
or more vitamins, one or more drugs, cessation of nicotine use, and cessation of alcohol use. 
wherein the health assessment report provides information about treatment options for a 
particular disorder. Treatment options comprise, for example, one or more of diet, one or 
more drugs, physical therapy, and surgery. In one embodiment, the health assessment report 
provides information about the efficacy of a particular treatment regimen and options for 
therapy adjustment. 

In another embodiment, electronically processing the digital molecular data and 
digital health data to generate a health assessment report comprises using the digital 
molecular data and/or digital health data as inputs for an algorithm or a rule-based system 
that determines whether the individual is at risk for a specific disorder, e.g., a vascular 
disorder, such as CAD or MI. Electronically processing the digital molecular data and digital 
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health data may also comprise using the digital molecular data and digital health data as 
inputs for an algorithm or a rule-based system based on one or more databases comprising 
stored digital molecular data and/or digital health data relating to one or more disorders, e.g., 
vascular disorders, such as CAD or ML 
5 In another embodiment, processing the digital molecular data and digital health data 

comprises using the digital molecular data and digital health data as inputs for an algorithm 
or a rule-based system based on one or more databases comprising: (i) stored digital 
molecular data and/or digital health data from a plurality of healthy individuals, and (ii) 
stored digital molecular data and/or digital health data from one or more pluralities of 

10 unhealthy individuals, each plurality of individuals having a specific disorder. At least one 
of the databases can be a public database. In one embodiment, the digital health data and 
digital molecular data are transmitted via, e.g., a communications network, e.g., the Intemet, 
to a medical information system for processing. 

A database of stored molecular data and health data, e.g., stored digital molecular 

1 5 data and/or digital health data, from a plurality of individuals, is further provided. A 

database of stored digital molecular data and/or digital health data from a plurality of healthy 
individuals, and stored digital molecular data and/or digital health data from one or more 
pluralities of unhe8ilthy individuals, each plurality of individuals having a specific disorder, 
e.g., a vascijlar disorder, is also provided. 

20 The new methods and systems of the invention provide healthcare providers v^th 

access to ever-grov^ng relational databases that include both molecular data and health data 
that is linked to specific disorders, e.g., vascular disorders. In addition public medical 
knowledge is screened and abstracted to provide concise, accurate information that is added 
to the database on an ongoing basis. In addition, new relationships between particular SNPs, 

25 e.g., SNPs associated with vascular disease, or genetic mutations and specific discords are 
added as they are discovered. 

The invention now being generally described, it will be more readily understood by 
reference to the following examples which are included merely for purposes of illustration of 
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certain aspects and embodiments of the present invention, and are not intended to limit the 
invention. The contents of all references, issued patents and published patent applications 
cited throughout this application, as well as the Figures, Tables, and database references, 
including GenBank Accession Numbers, are incorporated herein by reference. The practice 
5 of the present invention will employ, unless otherwise indicated, conventional techniques of 
cell biology, cell culture, molecular biology, transgenic biology, microbiology, recombinant 
DNA, and immunology, which are within the skill of the art. Such techniques are explained 
fully in the literature. See, for example, Molecular Cloning A Laboratory Manual, 2nd Ed., 
ed. by Sambrook, Fritsch and Maniatis (Cold Spring Harbor Laboratory Press: 1989); DNA 

10 Cloning, Volumes I and II (D. N. Glover ed., 1985); Oligonucleotide Synthesis (M. J. Gait 
ed., 1984); MuUis et al U.S. Patent Number 4,683,195; Nucleic Acid Hybridization (B. D. 
Hames & S. J. Higgins eds. 1984); Transcription And Translation (B. D, Hames & S, J. 
Higgins eds. 1984); Culture Of Animal Cells (R. I. Freshney, Alan R. Liss, Inc., 1987); 
Immobilized Cells And Enzymes (IRL Press, 1986); B. Perbal, A Practical Guide To 

15 Molecular Cloning (1984); the treatise. Methods In Enzymology (Academic Press, Inc., 
N.Y.); Gene Transfer Vectors For Mammalian Cells (J. H. Miller and M. P. Calos eds., 
1987, Cold Spring Harbor Laboratory); Methods In Enzymology^ Vols. 154 and 155 (Wu et 
al eds.), Immunochemical Methods In Cell And Molecular Biology (Mayer and Walker, eds.. 
Academic Press, London, 1987); Handbook Of Experimental Immunology, Volumes I-IV (D. 

20 M. Weir and C. C. Blackwell, eds., 1986); Manipulating the Mouse Embryo, (Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986). 

EXAMPLES 

25 Example 1: Detection of polymorphic regions in the human THBS2, ACE, and FGB 

genes 

This example describes the detection of polymorphic regions in the human THBS2, 
ACE, and FGB genes through use of denaturing high performance liquid chromatography 
(DHPLC), variant detector arrays, polymerase chain reaction (PCR), and direct sequencing. 
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Cell lines derived from an ethnically diverse population were obtained and used for 
single nucleotide polymorphism (SNP) discovery by methods described in Cargill, et al 
(1999) Nature Genetics 22:231-238. 

Genomic sequence representing the coding and partial regulatory regions of genes 
5 were amplified by polymerase chain reaction and screened via two independent methods: 
denaturing high performance liquid chromatography (DHPLC) or variant detector arrays 
(Affymetrix™). 

DHPLC uses reverse-phase ion-pairing chromatography to detect the heteroduplexes 
that are generated during amplification of PGR fragments from individuals who are 

10 heterozygous at a particular nucleotide locus within that fragment (Oefiier and Underbill 
(1995) ^w. J. Human Gen. 57:SuppL A266). 

Generally, the analysis was carried out as described in O'Donovan et al. ((1998) 
Genomics 52:44-49). PGR products having product sizes ranging from about 150-400 bp 
were generated using the primers and PGR conditions described in Example 2. Two PGR 

15 reactions were pooled together for DHPLC analysis (4 ul of each reaction for a total of 8 ul 
per sample). DHPLC was performed on a DHPLC system purchased from Transgenomic, 
Inc. The gradient was created by mixing buffers A (O.IM TEAA) and B (O.IM TEAA, 25% 
Acetontitrile). WAVEmaker™ software was utilized to predict a melting temperature and 
calculate a buffer gradient for mutation analysis of a given DNA sequence. The resulting 

20 chromatograms were analyzed to identify base pair alterations or deletions based on specific 
chromatographic profiles. 



Detection of polymorphic regions in the human THBS2, ACEy and FGB genes by SSCP 
Genomic DNA from the cell lines derived from an ethnically diverse population as 
25 described in Cargill, et al (1 999) Nature Genetics 22:23 1 -238, was subjected to PGR in 25 
^il reactions (IX PGR Amplitaq polymerase buffer, 0.1 mM dNTPs, 0.8 \M 5' primer, 0.8 
(iM 3' primer, 0.75 units of Amplitaq polymerase, 50 ng genomic DNA) using each of the 
above described pairs of primers under the following cycle conditions: 94*^C for 2 min, 35 x 
[94°C for 40 sec, 57^C for 30 sec, 72°C for 1 min], 72°C 5 min, 4°C hold. 
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The amplified genomic DNA fragments were then analyzed by SSCP (Orita et ah 
(1989) PNAS USA 86:2766, see also Cotton (1993) MutatRes 285:125-144; and Hayashi 
(1992) Genet Anal Tech Appl 9:73-79). From each 25 p.1 PCR reaction, 3 \i\ was taken and 
added to 7 jiil of loading buffer. The mixture was heated to 94°C for 5 min and then 
5 immediately cooled in a slurry of ice- water. 3-4 |il were then loaded on a 1 0% 

polyacrylamide gel either with 10% glycerol or without 10% glycerol, and then subjected to 
electrophoresis either overnight at 4 Watts at room temperature, overnight at 4 Watts at 4°C 
(for amplifying a 5' upstream regulatory element), or for 5 hours at 20 Watts at 4°C. The 
secondary structure of single-stranded nucleic acids varies according to sequence, thus 

10 allowing the detection of small differences in nucleic acid sequence between similar nucleic 
acids. At the end of the electrophoretic period, the DNA was analyzed by gently overlaying 
a mixture of dyes onto the gel (Ix the manufacturer's recommended concentration of S YBR 
Green I™ and SYBR Green II™ in 0.5 X TBE buffer (Molecular Probes™)) for 5 min, 
followed by rinsing in distilled water and detection in a Fluoroimager 575™ (Molecular 

15 Dynamics™). 

Identification of polymorphic regions in the human THBS2, ACE, or FGB gene by direct 
sequencing of PCR products 

To determine the sequences of the polymorphisms identified, the regions containing 
20 the polymorphisms were reamplified using flanking primers. The genomic DNA was 

subjected to PCR in 50 jal reactions (Ix PCR Amplitaq polymerase buffer, 0.1 mM dNTPs, 
0.8 nM 5' primer, 0.8 |liM 3' primer, 0.75 units of Amplitaq polymerase, 50 ng genomic 
DNA) using each of the pairs of primers under the following cycle conditions: 94°C for 2 
min, 35 x [94X for 40 sec, STC for 30 sec, 72^C for 1 min], 72°C 5 min, 4°C hold. The 
25 newly amplified products were then purified using the Qiagen Qiaquick PCR purification kit 
according to the manufacturer's protocol, and subjected to sequencing using the 
aforementioned primers which were utilized for amplification. 
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Case-Control population 

Several SNPs in each of the THBS2, ACE, and FGB genes were identified. Further 
analysis of the THBS2, ACE, and FGB SNPs included genotyping of the SNPs in large 
patient populations to assess their association with CAD and MI. A total of 352 U.S. 
5 Caucasian subjects with premature coronary artery disease were identified in 15 participating 
medical centers, fulfilling the criteria of either myocardial infarction, surgical or 
percutaneous revascularization, or a significant coronary artery lesion (e.g., at least a 70% 
stenosis in a major epicardial artery) diagnosed before age 45 in men or age 50 in women and 
having a living sibling who met the same criteria. The sibling with the earliest onset in a 

10 Caucasian subset of these families was compared with a random sample of 41 8 Caucasian 
controls without known coronary disease. Controls representing a general, unselected 
population were identified through random-digit dialing in the Atlanta, Georgia area. 
Subjects ranging in age from age 20 to age 70 were invited to participate in the study. The 
subjects answered a health questionnaire, had anthropometric measures taken, and blood 

1 5 drawn for measurement of serum markers and extraction of DN A. 

Statistical Analysis 

All analyses were done using the SAS statistical package (Version 8.0, SAS Institute 
Inc., Cary, N.C.). Differences between cases and controls were assessed with a chi-square 

20 statistic for categorical covariates and the Wilcoxon statistic for continuous covariates. 
Association between each SNP and two outcomes, CAD and MI, was measured by 
comparing genotype frequencies between controls and all CAD cases and the subset of cases 
with MI. Significance was determined using a continuity-adjusted chi-square or Fisher's 
exact test for each genotype compared to the homozygotes wild-type for that locus. Odds 

25 ratios were calculated and presented with 95% confidence intervals. 

Genotype groups were pooled for subsequence analysis of the top loci. Pooling 
allows the best model for each locus (dominant, codominant, or recessive) to be tested. 
Models were chosen based on significant differences between genotypes within a locus. A 
recessive model was chosen when the homozygous variant differed significantly from both 



-89- 



Atty. Docket No.: MMI-004 



the heterozygous and homozygous wildtype, and the latter two did not differ from each other. 
A codominant model was chosen when homozygous variant genotypes differed from both 
heterozygous and homozygous wild-type, and the latter two differed significantly from each 
other. A dominant model was chosen when no significant difference was observed between 
heterozygous and homozygous variant genotypes. 

Multivariate logistic regression was used to adjust for sex, presence of hypertension, 
diabetes, and body mass index using the LOGISTC procedure in SAS. Height and weight, 
measured at the time of enrolhnent, were used to calculate body mass index for each subject. 
Presence of hypertension and non-insulin-dependent diabetes was measures by self-report 
(controls) and medical record confirmation (cases). 

Results: Identified SNPs and Associations with Vascular Disease 
THBS2 

Two SNPs in the THBS2 gene were identified and foimd to be associated with 
vascular disease, e.g., CAD and MI. The first THBS2 SNP, referred to herein as G5755e5, is 
a change from the thymidine (T) to a guanine (G) in the THBS2 gene at residue 3949 of the 
reference sequence GI 307505. The second THBS2 SNP, referred to herein as G5755e9, is a 
change from a thymidine (T) to a cytidine (C) in the THBS2 gene at residue 4476 of the 
reference sequence GI 307505. These SNPs are located within the 3' untranslated region of 
the THBS2 gene. Therefore, they do not result in a change in the amino acid sequence of the 
THBS2 protein (see Table 1, below). 
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Table 1. 



1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


Gene 


PoIylD 


variant Type of Geno- 
freq. var. types 


Ref. 


Var. 


Genbank 
Accession 
No7nt 
position 


Flanking 
sequence 


SEQID 
NO: 


THBS2 


G5755e5 


.29 


3' 


GG 
GT 
TT 


T 


G 


GI: 307505/ 
nt 3949 


AATGGAA 
CgCAGAG 
ATG 


7 


THBS2 


G5755e9 


.13 


3' utr 


CC 
CT 
TT 


T 


C 


GI: 307505/ 
nt 4476 


TGCAAAT 
GGGTGTG 
AcGCGGT 
TCCAGAT 
GTG 


8 



The variant allele, G, of the THBS2 SNP G5755e5, was previously shown to be 
5 associated with vascular disease, e.g., MI and CAD. Individuals homozygous for the variant 
allele (GG) were at greater than 2-fold decreased odds of having vascular disease. 
Homozygous carriers of the variant allele of the G5755e9 SNP (CC) also showed a ~3-fold 
decreased odds of vascular disease. These two SNPs, G5755e5 and G5755e9, are in 
significant negative linkage disequilibrium with each other (D'=.49 (-), p=.04). The two 

10 SNPs together reveal distinct patterns of risk. Pattern 1 comprises two copies of the variant 
allele of G5755e9 (CC) in combination with two copies of the reference allele of G5755e5 
(TT). Pattern 2 comprises two copies of the reference allele of G5755e9 (TT) and two copies 
of the variant allele of G5755e5 (GG) (see Table 2, below). Patterns 1 and 2 may 
independently mfluence risk of CAD. Individuals who have pattern 1 or pattern 2 are at ~3- 

15 fold decreased odds of vascular disease (odds ratio=0.32, p=.001) (see Table 3, below). 
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2[ 



G5755e9 


G5755e5 


CAD controls 


OR 


P 


cc 


gg/gt 


0 


0 


- 


- 


cc 


tt 


2 


6 


OB 


ns 


tc 


gg 


5 


"3 


1.89 


ns 


tc 


gt 


25 


29 


0.98 


ns 


tc 


tt 


38 


40 


1.08 


ns 


tt 


gg 


9 


30 


0.34 


.01 


tt 


gt 


108 


99 


1.24 


.31 


tt 


tt 


103 


117 


1.00 





ns=non-significant; OR=odds ratio. All odds ratios are relative to last group in table. 



Table 3. 

CAD control 

patterns lor 2 11 36 

other 279 288 

Odds ratio: 0.32 p=.001 



ACE 

A SNP in the ACE gene, identified herein as G765u2, has been identified which is 
also associated with a decreased risk of vascvdar disease, e.g., ML and CAD. The G765u2 
SNP is a change fi-om an adenine (A) to a guanine (G) at nucleotide residue 86408 of the 
ACE reference sequence GI 13027555. This SNP is a "silent" variant. That is, it does not 
resuh in a change in the amino acid sequence of the ACE protein (see Table 4, below). 
Individuals with one copy of an A (the reference allele) and one copy of a G (the variant 
allele) at nucleotide residue 86408 of the ACE reference sequence GI 13027555 (AG 
genotype) are at a decreased risk for CAD and/or MI (CAD odds ratio:0.71; MI odds 
ratio:.66) (see Table 5, below). 

An insertion/deletion polymorphism in the ACE gene was previously associated with 
vascular disease, e.g., associated with a decreased risk for MI. The G765u2 SNP may be 
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found to be in linkage disequilibrium with the previously identified insertion/deletion 
polymorphism. If these two polymorphisms are in linkage disequilibrium (LD), the G765u2 
SNP would act as a marker for the insertion/deletion polymorphism. Regardless of LD 
between these two polymorphisms, the G765u2 SNP represents a novel association with 
vascular disease. 



Table 4. 



1 


2 


3 4 


5 


6 


7 8 


9 


10 


Gene 


PolylD 


variant Type of Geno- 


Ref. 


Var. Genbank 


Flanking 


SEQID 






freq. van 


types 




Accession 


sequence 


NO: 












NoVnt 
















position 






ACE 


G765u2 


silent 


GO 


A 


G GI: 13027555/ GAATGTG 


9 








AG 




nt 86408 


ATGGCCA 










AA 






CgTCCCG 
















GAAATAT 
















GAA 





Table 5. 



Gene 


PolylD 


Geno-type 


Controls 


CAD 


MI CAD Odds Ratio 


I Odds Ratio 










cases 


cases 




ACE 


G765u2 


GG 


78 


78 


43 1.05 (.71, 1.56) 


1.05 (.66, 1.68) 






AG 


185 


124 


64 0.71 (.51, .98) 


0.66 (.44, .95) 






AA 


137 


130 


72 1.00 


1.00 



FGB 

Two SNPs in the FGB gene, identified herein as FGBul and FGBu4, have been 
identified which are associated with decreased risk of vascular disease, e.g., CAD and/or MI. 
The first SNP, FGBul, is a change from a cytidine (C) to a thymidine (T) at nucleotide 
residue 51 18 of the FGB reference sequence GI 182597. This SNP is a silent variant. The 
second SNP, FGBu4, is a change from a guanine (G) to an adenine (A) at nucleotide residue 
8059 in the reference sequence GI 182597. This polymorphism is a missense variation 
which results in a change from an arginine (R) to a lysine (K) in the amino acid sequence of 
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FGB (SEQ ID NO:6) at amino acid residue 478 (see Table 6, below). 

For the FGBul SNP, individuals with two copies of a T (the variant allele) at 
nucleotide residue 51 19 of the FGB reference sequence GI 182597 are at a decreased risk for 
CAD and MI (CAD odds ratio: 0.28; MI odds ratio: 0.43). Individuals with one copy of a T 
and one copy of a C (the reference allele) at nucleotide residue 51 19 of the FGB reference 
sequence GI 1 82597 are also at a decreased risk for CAD and MI (CAD odds ratio: 0.66; MI 
odds ratio: 0.72) (see Table 7, below). 

For the FGBu4 SNP, individuals with two copies of an A (the variant allele) at 
nucleotide residue 8059 of the FGB reference sequence GI 182597 are at a decreased risk for 
CAD and MI (CAD odds ratio: 0.28; MI odds ratio: 0.43). Individuals with one copy of an 
A and one copy of a G (the reference allele) at nucleotide residue 51 19 of the FGB reference 
sequence GI 182597 are also at a decreased risk for CAD and MI (CAD odds ratio: 0.61 ; MI 
odds ratio: 0.66) (see Table 7). 

Two variants in the promoter region of the FGB gene at nucleotide residues -A55 and 
-655, have been previously associated with vascular disease, e.g., CAD and MI. The FGBul 
and FGBu4 SNPs may be found to be in Hnkage disequilibrium with these two previously 
identified SNPs. If these four SNPs are in linkage disequilibrium (LD), the FGBul and 
FGBu4 SNPs would act as markers for the previously identified SNPs. Regardless of LD, 
the FGBul and FGBu4 SNPs represent novel associations with vascular disease. 
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Table 6. 



1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


Gene 


PolylD 


variant Type of Geno- 
freq. var. types 


Ref. 


Var. 


Genbank 
Accession 
No./nt 
position 


Flanldng 
sequence 


SEQID 
NO: 


FGB 


FGBul 




silent 


TT 
CT 
CC 


C 


T 


GI: 182597/ TGAGACTG 
nt5119 TGAATAGtA 
ATATCCCA 

ACTAAC 


10 


FGB 


FGBu4 




Missense 
(R/K) 


AA 
AG 
GG 


G 


A 


GI: 182597/ CATGGTAC 
nt 8059 TCAATGAa 
GAAGATGA 
GTATGAA 


11 



Table 7. 



Gene 


PolylD 


Geno-type 


Controls 


CAD 


MI 


CAD Odds Ratio 


MI Odds Ratio 










cases 


cases 






FGB 


FGBul 


TT 


19 


5 


4 


0.28 (.10, 76) 


0.43 (.14, 1.28) 






CT 


133 


83 


47 


0.66 (.48, .92) 


0.72 (.48, 1.07) 






CC 


254 


240 


125 


1.00 


1.00 


FGB 


FGBu4 


AA 


19 


5 


4 


0.28 (.10, .76) 


0.43 (.14, 1.30) 






AG 


137 


78 


44 


0.61 (.44, .84) 


0.66 (.44, .99) 






GG 


255 


239 


124 


1,00 


1.00 
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Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following claims. 



